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Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in 
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act 
as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnik etal.. Proc. Natl. Acad. Sci. USA 83:4143-4146 [\9V6]). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by 
5 uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in 
the cells of the intended host. Techniques suitable for the transfer of nucleic acid into rnammalian cells in vitro 
include the use of liposomes, electroporation, micromjecnon, cell fusion, DEAE-dextran, the calcium phosphate 

1 0 precipitation method, etc . The currently preferred in vivo gene trans fcr techniques include transfection with viral 
(typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau el ah, Trends in 
Biotechnology 11. 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with 
an agent that targets the target cells, such as an antibody specific for a cell surface membrane protein or the 
target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind 

15 to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to facilitate 
uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which 
undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half- life. 
The technique of receptor-mediated endocytosis is described, for example, by Wu et ah, J. Biol. Chem. 262, 
4429-4432 (1987); and Wagner et ah, Proc. Nath Acad. Sci. USA 87, 3410-3414 (1990). For review of gene 

20 marking and gene therapy protocols see Anderson et ah, Science 256, 808-813 ( 1992). 

The PRO polypeptides described herein may also be employed as molecular weight markers for protein 
electrophoresis purposes. 

The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are 
useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 

25 markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently 
available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker. 

The PRO polypeptides and nucleic acid molecules of the present invention may also be used for tissue 
typing, wherein the PRO polypeptides of the present invention may be differentially expressed in one tissue as 
compared to another. PRO nucleic acid molecules will find use for generating probes for PCR, Northern 

30 analysis, Southern analysis and Western analysis. 

The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO 
polypeptides of the present invention can be formulated according to known methods to prepare phannaceutically 
useful compositions, whereby the PRO product hereof is combined in admixture with a phannaceutically 
acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient 

35 having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 
(Remington's Pharmaceutical Sciences 16th edition. Osoh A. Ed. (1980)), in the form of lyophilized 
formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the 
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dosages and concentrations employed, and include buffers such as phosphate, citrate and other organic acids; 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, 
such as serum albumin, gelatin or irnmunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino 
acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other 
carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as 
5 mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, 
PLTJRONICS™ or PEG. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes, prior to or following lyophilizauon and reconstitution. 

Therapeutic compositions herein generally are placed into a container having a sterile access port, for 

10 example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical 
administration, or by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 

15 vary depending on the particular use envisioned. The determination of the appropriate dosage or route of 
administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
performed following the principles laid down by Mordenti , J. and Chappell, W. "The use of interspecies scaling 
in toxicokinetics" In Toxicokinetics and New Drug Development, Yacobi et al., Eds., Pergamon Press, New 

20 York 1989, pp. 42-96. 

When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, normal 
dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, 
preferably about 1 /ig/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos, 

25 4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different 
treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may 
necessitate delivery in a manner different from that to another organ or tissue. 

Where sustained-release administration of a PRO polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring administration of the PRO 

30 polypeptide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustained release has been successfully performed with human growth hormone (rhGH), interferon- 
(rhIFN- ), interleukin-2, and MN rgp!20. Johnson et al., Nat. Med.. 2:795-799 (1996); Yasuda, Biomed. 
Ther. . 27:1221-1223 (1993); Hora et al., Bio/Technology. 8:755-758 (1990); Cleland, "Design and Production 
of Single Immunization Vaccines Using Polylactide Porygiycolide Microsphere Systems," in Vaccine Design: 

35 The Subunit and Adjuvant Approach. Powell and Newman, eds, (Plenum Press: New York, 1995), pp. 439-462; 
WO 97/03692, WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010. 
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The sustained-release formulations of these proteins were developed using poly-lactic -coglycolic acid 
(PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation 
products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the 
degradabiliry of this polymer can be adjusted from months to years depending on its molecular weight and 
composition. Lewis, "Controlled release of bioacrive agents from lactide/glycoiide polymer, " in: M. Chasm and 
5 R. Langer (Eds.), Biodegradable Polymers as Drue Delivery Systems (Marcel Dekker: New York, 1990), pp. 
Ml. 

This invention encompasses methods of screening compounds to identify those that mimic the PRO 
polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides 

10 encoded by the genes identified herein, or otherwise interfere with the interaction of the encoded polypeptides 
with other cellular proteins. Such screening assays will include assays amenable to high- throughput screening 
of chemical libraries, making them particularly suitable for identifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

15 All assays for antagonists are common in that they call for contacting the drug candidate with a PRO 

polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the 

20 drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 
attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of 
the PRO polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific 
for the PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed 
by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 

25 component, e.g., the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are 
detected. When the originally non-immobilized component carries a detectable label, the detection of label 
immobilized on the surface indicates that complexing occurred. Where the originally non- immobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled antibody 

30 specifically binding the immobilized complex. 

If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by 
a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co 
immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein- 

35 protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song, Nature (London). 340:245-246 (1989); Chien et ah, Proc. Natl. Acad. Sci. USA . 88:9578- 
9582 (1991)) as disclosed by Chevray and Nathans, Proc. Natl. Acad. Sci. USA. 89: 5789-5793 H99H. Many 
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transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting 
as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast 
expression system described in the foregoing publications (generally referred to as the "two-hybrid system") 
takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to 
the DNA-binding domain of GALA, and another, in which candidate activating proteins are fused to the 
5 activation domain. The expression of a GAL 1 -toe Z reporter gene under control of a GAM-activated promoter 
depends on reconstitution of GAM activity via protein-protein interaction. Colonies containing interacting 
polypeptides are detected with a chromogenic substrate for p-galactosidase. A complete kit 
(MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two- 
hybrid technique is commercially available from Clontech. This system can also be extended to map protein 
10 domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 
these interactions. 

Compounds that interfere with the interaction of a gene encoding a PRO polypeptide identified herein 
and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared 
containing the product of the gene and the intra- or extracellular component under conditions and for a time 

15 allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 
inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a 
placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) 
between the test compound and the intra- or extracellular component present in the mixture is monitored as 
described hereinabove. The formation of a complex in the control reaction(s) but not in the reaction mixture 

20 containing the test compound indicates that the test compound interferes with the interaction of the test compound 
and its reaction partner. 

To assay for antagonists, the PRO polypeptide may be added to a cell along with the compound to be 
screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence 
of the PRO polypeptide indicates that the compound is an antagonist to the PRO polypeptide. Alternatively, 

25 antagonists may be detected by combining the PRO polypeptide and a potential antagonist with membrane-bound 
PRO polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition 
assay. The PRO polypeptide can be labeled, such as by radioactivity, such that the number of PRO polypeptide 
molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene 
encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, 

30 ligand panning and FACS sorting. Coligan et al. t Current Protocols in Immun.. 1(2): Chapter 5 (1991). 
Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to 
the PRO polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS 
cells or other cells that are not responsive to the PRO polypeptide. Transfected cells that are grown on glass 
slides are exposed to labeled PRO polypeptide. The PRO polypeptide can be labeled by a variety of means 

35 including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are 
prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a 
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single clone that encodes the putative receptor. 

As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffinity- 
linked with cell membrane or extract preparations that express the receptor molecule. Cross -linked material is 
resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, 
resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
5 from micro- sequencing would be used to design a set of degenerate oligonucleotide probes la screen a cDNA 
library to identify the gene encoding the putative receptor. 

In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled PRO polypeptide in the presence of the candidate compound. The ability of 
the compound to enhance or block this interaction could then be measured. 

10 More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 

iinmunoglobulin with PRO polypeptide, and, in particular, antibodies including, without limitation, poly- and 
monoclonal antibodies and antibody fragments, single-chain antibodies, ami- idiotypic antibodies, and chimeric 
or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. 
Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO 

15 polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 
PRO polypeptide. 

Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using 
amisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used 

20 to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be 
complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res.. 

25 6:3073 (1979); Cooney et al., Science . 241: 456 (1988); Dervan et al., Science . 251:1360 (1991)), thereby 
preventing transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PRO polypeptide 
(antisense - Okano, Man^teflL, 56:560 (1991); Oligodeoxvnucleotides as A ntisense Inhibitors of Gene 
Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered 

30 to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the PRO 
polypeptide. When antisense DNA is used, oligcdeoxyribonucleotides derived from the translation-initiationsite, 
e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred. 

Potential antagonists include small molecules that bind to the active site, the receptor binding site, or 
growth factor or other relevant binding site of the PRO polypeptide, thereby blocking the normal biological 

35 activity of the PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides 
or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic 
compounds. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 
Ribozymesact by sequence-specifichybridizationto the complementary target RNA, followed by endonucleolytic 
cleavage. Specific ribozyrae cleavage sites within a potential RNA target can be identified by known techniques. 
For further details see, e.g. , Rossi, Current Biology. 4:469-471 (1994), and PCT publication No. WO 97/33551 
(published September 18, 1997). 
5 Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 

and composed of deoxy nucleotides. The base composition of these oligonucleotides is designed such that it 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 
97/33551, supra, 

10 These small molecules can be identified by any one or more of the screening assays discussed 

hereinabove and/or by any other screening techniques well known for those skilled in the art. 

PR0189 can be used in assays with W01A6.1 of C. Elegans, phosphodiesterases, transporters and 
proteins which bind to fatty acids, to determine the relative activities of PRO 189 against these proteins. The 
results can be applied accordingly. 

15 

F. Anti-PRO Annies 
The present invention further provides anti-PRO antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

20 I. Polyclonal Anybodies 

The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent 
and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 

25 immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in die mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine 
thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

30 The immunization protocol may be selected by one skilled in the an without undue experimentation. 

2. MonoclonaJ Antibodies. 
The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 
prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature . 256:495 (1975). 
35 In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 



365 



WO 99/63088 



PCT/US99/12252 



The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen 
cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a 
hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59- 103 J. 
5 Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine 
and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or 
survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine 
guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
10 include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk 

15 Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor , J. Immunol. . 133 : 300 1 ( 1 984) ; Brodeur et al . , Monoclonal 
Antibody Prod uction Techniques and Applications. Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of 

20 monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are 
known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem.. 107:220 (1980). 

25 After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 

procedures and grown by standard methods [Goding, supral . Suitable culture media for this purpose include, 
for example, Dulbecco's Modified Eagle 's Medium and RPM1- 1640 medium. Alternatively, the hybridoma cells 
may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture 

30 medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein 
A-Sepharose, hydroxy lapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinant DNA methods, such as those described 
in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated 
and sequenced using conventional procedures (e.g. , by using oligonucleotide probes that are capable of binding 

35 specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the 
invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression 
vectors, which are then transfected into host cells such as simian COS cells. Chinese hamster ovary (CHO) cells, 
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or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the 
coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences 
[U.S. Patent No. 4,816,567; Morrison et al., supral or by covalently joining to the inunuroglobulin coding 
sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non- immunoglobulin 
5 polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for 
the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent 
antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 
known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and 
10 modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent 
heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid 
residue or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 
produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known 
15 in the art. 

3. Human anfl H umani7 *d Antibodies 
The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, 

20 immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 

25 some instances , Fv framework residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody 
nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are 

30 those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones 
etal.. Nature. 321 :522-525(1986): Riechmannetal., Nature, 332:323-329 (1988); and Presta, Curr. Op. Struct. 
Biol.. 2:593-596 (1992)1. 

Methods for humanizing non-human antibodies are well known in the an. Generally, a humanized 
35 antibody has one or more amino acid residues introduced into it from a source which is non-human. These non- 
human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" 
variable domain. Humanization can be essentially performed following the method of Winter and co-workers 
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[Jones et al., Nature , 221:522-525 (1986); Riechmann et al., Nature, 112:323-327 (1988); Verhoeyen et al., 
Science . 239:1534-1536 (1988)1, by substituting rodent CDRs or CDR sequences for the corresponding 
sequences of a human antibody. Accordingly, such " humanized " antibodies are chimeric antibodies (U.S. Patent 
No. 4,8i6,567), wherein substantially less than an intact human variable domain has been substituted by the 
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 

5 antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, including phage 
display libraries [Hoogenboom and Winter, J. Mo). Biol.. 227:381 (1991); Marks et al., J.Mol.Biol., 222:581 
(1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparation of human 

10 monoclonal antibodies (Cole et a)., Monoclonal Antibodies and Cancer Therapy. Alan R. Liss, p. 77 (1985) and 
Boerner et al., J. Immunol. . 147H) :86-95 (1991)1. Similarly, human antibodies can be made by introducing 
of human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin 
genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and 

15 antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in ihe following scientific publications: Marks et al., 
Bio/Technology 10. 779-783 (1992); Lonberg exal.. Nature 368 856-859(1994); Morrison, Nature 368,812-13 
(1994); Fishwild et al, Nature Biotechnology 14 . 845-51 (1996); Neuberger, Nature Biotechnology M, 826 
(1996); Lonberg and Huszar, Intern. Rev. Immunol. 13 65-93 (1995). 

20 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the 
PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 
25 subunit. 

Methods for making bispecific antibodies are known in the an. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/li ght-cham 
pairs, where the two heavy chains have different specificities IMilstein and Cuello, Mature, 305 :537-539 ( 1 983)] . 
Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
30 produce a potential mixture of ten different antibody molecules, of which only one has the correct bispecific 
structure. The purification of the correct molecule is usually accomplished by affinity chromatography steps. 
Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker et al., EMBQ 
L t 10:3655-3659(1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can 
35 be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin 
heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to 
have the first heavy-chain constant region (CHI) containing the site necessary for light-chain binding present in 
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at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et al.. Methods 
m Enzvmologv. 121:210(1986). 

According to another approach described in WO 96/27011, the interface between a pair of antibody 
5 molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant 
cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. 
In this method, one or more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar 
size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large 

10 amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing 
the yield of the heterodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab') 2 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical 

15 linkage. Brennan et at. , Science 229:81 ( 1985) describe a procedure wherein intact antibodies are proteolytically 
cleaved to generate F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol complexing 
agent sodium arsenite to stabilize vicinal dithiols and prevent intcrmolecular disulfide formation. The Fab' 
fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab* -thiol by reduction with mercaptoemylamine and is mixed with an 

20 equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 

Fab 1 fragments may be directly recovered from E. coli and chemically coupled to form bispecific 
antibodies. Shalaby et at., J. Exp. Med. 175:217-225 (1992) describe the producuon of a fully humanized 
bispecific antibody F(ab*>2 molecule. Each Fab* fragment was separately secreted from E. coli and subjected 

25 to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various technique for making and isolating bispecific antibody fragments directly from recombinant cell 
culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. 

30 Kostelny at., J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and Jun 
proteins were linked to the Fab* portions of two different antibodies by gene fusion. The antibody homodimers 
were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. 
This method can also be utilized for the producuon of antibody homodimers. The "diabody™ technology 
described by Hollinger et a/., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative 

35 mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V H ) connected to a light-chain variable domain (VJ by a linker which is too short to allow pairing between the 
two domains on the same chain. Accordingly, the V H and V, domains of one fragment are forced to pair with 
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the complementary V L and V H domains of another fragment, thereby forming two antigen-binding sites. Another 
strategy for making bispecific antibody fragments by the use of single -chain Fv (sFv) dimers has also been 
reported. See, Gruber etal., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies arc' contemplated. For example, trispecific antibodies can be prepared. 
Tult et al. t J. Immunol. 147:60 (1991). 
5 Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide herein. 

Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a triggering molecule 
on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28. or B7), or Fc receptors for IgG 
(FcyR). such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD 16) so as to focus cellular defense mechanisms 
to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize 
10 cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding 
arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPT A, DOT A, 
or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue factor 
(TF). 

15 5. Heteroconiueate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
antibodies are composed of two covalently joined antibodies. Such antibodies have , for example, been proposed 
to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV infection 
[WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro 

20 using known methods in synthetic protein chemistry, including those involving crosslinking agents. For 
example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaplobutyriraidate and 
those disclosed, for example, in U.S. Patent No. 4,676,980. 

25 6. Effector Function Engineering 

It may be desirable to modify the antibody of the invention with respect to effector function, so as to 
enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be 
introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 
homodimeric antibody thus generated may have improved internalization capability and/or increased 

30 complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al, L. 
Exp Med .. 176: 1191-1195 (1992) and Shopes. J. Immunol .. 148 : 2918-2922 (1992). Homodimeric antibodies 
with enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as described in 
Wolff et at. Cancer Research. 5J: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual 
Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al. , Anti- 

35 Cancer PrM PesiM1. 3 : 219-230 (1989). 
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7. Immunoconiueates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin (e.g. , an enzymaucaHy active toxin of bacterial, fungal, plant, or animal 
origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. 
Enzymatic ally active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active 
fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, aipha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytoiaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, cure in, crotin, sapaonaria officinalis inhibitor, 
gelonin, mitogellin, restrictocin, phenomycin, enoraycin, and the tricothecenes. A variety of radionuclides are 
available for the production of radioconjugated antibodies. Examples include 21J Bi, l3l In f 91> Y. and IW Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling 
agents such as N-succiiumidyl-3-(2.pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional 
derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), 
aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 
diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared as described in Vitetta et al. , Science . 228: 1098 (1987). Carbon- 14-labeled 1- 
isotWocyanatobenzyl-3-methyIdiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radiomicleotide to the antibody. See W094/1 1026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in rumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed 
by removal of unbound conjugate from the circulation using a clearing agent and then administration of a 
"ligand" (e.g., avidin) that is conjugated to a cytotoxic agent (e.g., a radiomicleotide). 

8. Immunolroosomes 
The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing 
the antibody are prepared by methods known in the an, such as described in Epstein et al., Proc. Nafl. Acad. 
Sci. USA. &: 3688 (1985); Hwang et al., Proc. Natl Acad. Sci. USA. 77: 4030 (1980); and U.S. Pat. Nos. 
4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 
5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprismg phosphatidyl^ 

PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
Fab* fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 
et al .. J. Biol. Chem. . 257 : 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agent 
(such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al , J. National Cancer Inst., 
Si(19): 1484(1989). 
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9. Pharmaceutical Compositions of Antibodies 
Antibodies specifically binding a PRO polypeptide identifiedherein, as well as other molecules identified 
by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the 
form of pharmaceutical compositions. 

If the PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing 
5 antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable- 
region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 

10 technology. See, e.g., Marasco et al.. Proc. Natl. Acad. Sci. USA. 90: 7889-7893 (1993). The fbnnuiation 
herein may also contain more lhan one active compound as necessary for the particular indication being treaied, 
preferably those with complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition may comprise an agent that enhances its function, such as, for example, a cytotoxic 
agent, cytokine, chemotherapeiuic agent, or growth-inhibitory agent. Such molecules are suitably present in 

15 combination in amounts that are effective for the purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and 
poly-(methylmethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsions. 

20 Such techniques are disclosed in Remington's Pharmaceutical Sciences , supra. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers conuining the antibody, which matrices are in 

25 the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poiy(vinylalcohol)), polylactides 
(U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON DEPOT ™ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poty-D-(-)-3- 

30 hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release 
of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
antibodies remain in the body for a long time, they may denature or aggregate as a result of exposure to moisture 
at 37°C, resulting in a loss of biological activity and possible changes in immunogenicity. Rational strategies 
can be devised for stabilization depending on the mechanism involved. For example, if the aggregation 

35 mechanism is discovered to be intermolecularS-S bond formation through thio-disulfide interchange, stabilization 
may be achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture 
content, using appropriate additives, and developing specific polymer matrix compositions. 



372 



WO 99/63088 



PCT/US99/12252 



G. Uses for anti-PRO Antibodies 

The anti-PRO antibodies of the invention have various utilities. For example, anti-PRO antibodies may 
be used in diagnostic assays for PRO, e.g. , detecting its expression in specific cells, tissues, or serum. Various 
diagnostic assay techniques known in the an may be used, such as competitive binding assays, direct or indirect 
sandwich assays and inununoprecipiiation assays conducted in either heterogeneous or homogeneous phases 
5 [Zola, Monoclonal Antibodies: A Manual of Techniques. CRC Press, Inc. (1987) pp. 147-158]. The antibodies 
used in the diagnostic assays can be labeled with a detectable moiety. The detectable moiety should be capable 
of producing, either directly or indirectly, a detectable signal. For example, the detectable moiety may be a 
radioisotope, such as 3 H, 14 C f 32 P, M S, or I23 l, a fluorescent or chemiluminescent compound, such as fluorescein 
isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or 

1 0 horseradish peroxidase . Any method known in the art for conjugating the antibody to the detectable moiety may 
be employed, including those methods described by Hunter et al., Nature . 144:945 (1962); David et al.. 
Biochemistry. 13:1014 (1974); Painetal., J. Immunol. Meth.. 40:219 (1981); and Nygren, J. Histochem. and 
Cvtochem.. 3&:407 (1982). 

Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture 

15 or natural sources. In this process, the antibodies against PRO are immobilized on a suitable support, such a 
Sephadex resin or filter paper, using methods well known in the art. The immobilized antibody then is contacted 
with a sample containing the PRO to be purified, and thereafter the support is washed with a suitable solvent that 
will remove substantially all the material in the sample except the PRO, which is bound to the immobilized 
antibody. Finally, the support is washed with another suitable solvent that will release the PRO from the 

20 antibody. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

All patent and literature references cited in the present specification are hereby incorporated by reference 
in their entirety. 

25 

EXAMPLES 

Commercially available reagents referred to in the examples were used according to manufacturer's 
instructions unless otherwise indicated. The source of those cells identified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, 
30 va. 

EXAMPLE 1 : Extracellular Domain Homology Scre ening to Identify Novel Polypeptides and cDNA Encoding 
Therefor 

The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 
35 950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The EST 
databases included public databases (e.g., Dayhoff, GenBank), and proprietary databases (e.g. LIFESEQ™, 
Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program WU-BLAST-2 
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(Altschul et al., Methods in Enzvmologv 266:460-480 (19%)) as a comparison of the ECD protein sequences 
to a 6 frame translation of the EST sequences. Those comparisons with a Blast score of 70 (or in some cases 
90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences 
with the program "phrap" (Phil Green, University of Washington, Seattle, WA). 

Using this extracellular domain homology screen, consensus DNA sequences were assembled relative 
5 to the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were 
often (but not always) extended using repeated cycles of WU-BLAST-2 and phrap to extend the consensus 
sequence as far as possible using the sources of EST sequences discussed above. 

Based upon the consensus sequences obtained as described above, oligonucleotides were then 
synthesized and used to identify by PCR a cDNA library that contained the sequence of interest and for use as 

10 probes to isolate a clone of the full-length coding sequence for a PRO polypeptide. Forward and reverse PCR 
primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 100- 
1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional 
oligonucleotides are synthesized when the consensus sequence is greater than about 1 - 1 .5kbp. In order to screen 
several libraries for a full-length clone, DNA from the libraries was screened by PCR amplification, as per 

15 Ausubel et al. , Current Protocols in Molecular Biology, with the PCR primer pair. A positive library was then 
used to isolate clones encoding the gene of interest using the probe oligonucleotide and one of the primer pairs. 

The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 
commercially available reagents such as those from lnvitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 

20 appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al . , Science . 
251:1278-1280 (1991)) in the unique Xhol and NotI sites. 

EXAMPLE 2 : Isolation of cDNA clones by Amylase Screening 
25 1. Preparation of oligo dT primed cDNA library 

mRNA was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San 

Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector 

pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasmid System). 

In this procedure, die double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI tinkered cDNA 
30 was cloned into XhoI/NotI cleaved vector. pRK5D is a cloning vector that has an sp6 transcription initiation 

site followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 

2. Preparation of random primed cDNA library 

A secondary cDNA library was generated in order to preferentially represent the 5* ends of the primary 
35 cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was used to 
generate a random primed cDNA library in the vector pSST-AMY.O using reagents and protocols from Life 
Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA 
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was sized to 500-1000 bp, tinkered with blunt to NotI adaptors, cleaved with Sfil, and cloned into Sfil/Notl 
cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding 
the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal) 
followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, cDNAs cloned into this 
vector that are fused in frame with amylase sequence will lead to the secretion of amylase from appropriately 
5 transfected yeast colonies. 

3. Transformation and Detection 

DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompetent DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then 

10 electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies, 1 ml) was 
added and the mixture was incubated at 37 °C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing ampicillin and incubated for 16 hours (37 °C). Positive colonies were 
scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl- 
gradient. The purified DNA was then carried on to the yeast protocols below. 

15 The yeast methods were divided into three categories: (1) Transformation of yeast with the 

plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR 
amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and further 
analysis. 

The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT 
20 alpha, ura3-52, leu2-3, leu2-112, his3-ll, his3-15, MAL+, SUC\ GAL + . Preferably, yeast mutants can be 
employed that have deficient post-translational pathways. Such mutants may have translocation deficient alleles 
in seel I, secll y jcc62, with truncated 5«:71 being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins 
implicated in this post translation pathway (e.g., SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) 
25 or the complex formation of these proteins may also be preferably employed in combination with the amylase- 
expressing yeast. 

Transformation was performed based on the protocol outlined by Gietz et al . , Nucl. Acid. Res.. 2Q: 1425 

( 1992). Transformed cells were then inoculated from agar into YEPD complex media broth ( 100 ml) and grown 

overnight at 30° C. The YEPD broth was prepared as described in Kaiser et al., Methods in Yeast Genetics . 
30 Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culture was then diluted to 

about 2 x 10 6 cells/ml (approx. 00600=0.1) into fresh YEPD broth (500 ml) and regrown to 1 x 10 7 cells/ml 

(approx. OD 600 =0.4-0.5). 

The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a 

Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, 
35 and centrifuged again in 50 ml falcon tubes at 3.500 rpm in a Beckman GS-6KR centrifuge. The supernatant 

was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCl, 1 mM EDTA 

pH 7.5, 100 mM Li^OOCCHj), and resuspended into LiAc/TE (2.5 ml). 
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Transformation took place by mixing the prepared cells ( 100 jd) with freshly denatured single stranded 
salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transforming DNA (1 /ig, vol. < 10 p\) in 
microfuge tubes. The mixture was mixed briefly by vortexing. then 40% PEG/TE (600 pi 40% polyethylene 
glycol-4000, 10 mM Tris-HCl, 1 mM EDTA, 100 mM Li 2 OOCCH 3 , pH 7.5) was added. This mixture was 
gently mixed and incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42 "C 
5 for 15 minutes, and the reaction vessel centrifuged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and 
resuspended into TE (500 jxl, 10 mM Tris-HCI, 1 mM EDTA pH 7.5) followed by recentrifugation. The cells 
were then diluted into TE (1 ml) and aiiquots (200 /d) were spread onto the selective media previously prepared 
in 150 mm growth plates (VWR). 

Alternatively , instead of multiple small reactions , the transformation was performed using a single, large 
10 scale reaction, wherein reagent amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser et al. ( Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 
208-210 (1994). Transformams were grown at 30°C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red starch in the selective 
15 growth media. Starch was coupled to the red dye (Reactive Red-120, Sigma) as per the procedure described by 
Biely et al., Anal. Biochem.. 172 : 176-179 (1988). The coupled starch was incorporated into the SCD-Ura agar 
plates at a final concentration of 0. 15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50- 
100 mM final concentration). 

The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in 
20 order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase 
secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were 
determined by their ability to break down starch resulting in a clear halo around the positive colony visualized 
directly. 

25 4. Isolation of DNA bv PCR Amplification 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile 
water (30 fd) in a 96 well plate. At this time, the positive colonies were either frozen and stored for subsequent 
analysis or immediately amplified. An aliquot of cells (5 mD was used as a template for the PCR reaction in a 
25 til volume containing: 0.5 /d Klentaq (Clontech, Palo Alto, CA); 4.0 pi 10 mM dNTP's (Perkin Elmer- 
30 Cetus); 2.5 jd Kentaq buffer (Clonlech); 0.25 fil forward oligo 1 ; 0.25 pi reverse oligo 2; 12.5 pi distilled water. 
The sequence of the forward oligonucleotide 1 was: 

5^TflTAAAACGACGGCCAGT TAAATAGACCTGCAATTATTAATCT- 3' (SEQ ID NO: 3) 
The sequence of reverse oligonucleotide 2 was: 

S'-rAGGAAACAGCTATGACC ACCTGCACACCTGCAAATCCATT> 3' (SEQ ID NO:4) 
35 PCR was then performed as follows: 

a. Denature 92 °C, 5 minutes 

b. 3 cycles of: Denature 92°C, 30 seconds 
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AlUlMU 

Extend . 


59°C, 
72°c! 


30 seconds 
60 seconds 


c. 


3 cycles of: 


Denature 

Anneal 

Extend 


57°C; 
72°C, 


92°C, 30 seconds 
30 seconds 
60 seconds 


d. 


25 cycles of: 


Denature 

Anneal 

Extend 


55°C, 
72°C, 


92°C, 30 seconds 
30 seconds 
60 seconds 


e. 




Hold 




4°C 



The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present. 
15 Typically, the first 18 nucleotides of the 5* end of these oligonucleotides contained annealing sites for the 
sequencing primers. Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, 
signal sequence-fused cDNA resulted in considerably longer nucleotide sequences. 

Following the PCR, an aliquot of the reaction (5 pi) was examined by agarose gel electrophoresis in 
a 1% agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al., supra. 
20 Clones resulting in a single strong PCR product larger than 400 bp were further analyzed by DNA sequencing 
after purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc., Chatsworth, CA). 

EXAMPLE 3 : Isolation of cDNA Clones Using Signal Algorithm Analysis 

Various polypepuae-encoding nucleic acid sequences were identified by applying a proprietary signal 

25 sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as 
clustered and assembled EST fragments from public (e.g., GenBank) and/or private (LIFESEQ®, lncyte 
Pharmaceuticals, Inc.. Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal 
score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine 
codon(s) (ATG) at the 5 '-end of the sequence or sequence fragment under consideration. The nucleotides 

30 following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first 
ATG has the required amino acids, the second is not examined. If neither meets the requirement, the candidate 
sequence is not scored. In order to determine whether the EST sequence contains an authentic signal sequence, 
the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven 
sensors (evaluation parameters) known to be associated with secretion signals. Use of this algorithm resulted 

35 in the identification of numerous polypeptide -encoding nucleic acid sequences. 

EXAMPLE 4 : Isolation of cDNA clones Encoding Human PRQ281 

In order to obtain a cDNA clone encoding PR0281, methods described in Klein et al., Proc. Natl, 
/lead. Sci. USA 93:7108-7113 (1996) were employed with the following modifications. Yeast transformauon 
40 was performed with limiting amounts of transforming DNA in order to reduce the number of multiple 
transformed yeast cells. Instead of plasmid isolation from the yeast followed by transformation of E. coli as 
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described in Klein et al., supra. PCR analysis was performed on single yeast colonies. PCR primers employed 
were bipartite in order to amplify the insert and a small portion of the invertase gene (allowing to determine that 
the insert was in frame with invertase) and to add on universal sequencing primer sites. 

An invertase library was transformed into yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0281 , was determined 
5 to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
nucleotide sequence of PR0281. A full length plasmid library of cDNAs from human umbilical vein 
endothelium tissue was titered and approximately 100,000 cfu were plated in 192 pools of 500 cru/pool into 96- 
well round bottom plates. The plates were sealed and pools were grown overnight at 37°C with shaking 
(200rpm). PCR was performed on the individual cultures using primers. Agarose gel electrophoresis was 

10 performed and positive wells were identified by visualization of a band of the expected size. Individual positive 
clones were obtained by colony lift followed by hybridization with 33 P-labeled oligonucleotide. These clones 
were characterized by PCR, restriction digest, and southern blot analyses. 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 80-82, and a stop signal at nucleotide positions 1115-1117 

15 (Figure 1, SEQ ID NO:l). The predicted polypeptide precursor is 345 amino acids long, has a calculated 
molecular weight of approximately 37,205 daltons and an estimated pi of approximately 10.15. Analysis of the 
full-length PR0281 sequence shown in Figure 2 (SEQ ID NO: 2) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 14, multiple transmembrane domains from about 
amino acid position 83 to about amino acid position 105, from about amino acid position 126 to about amino acid 

20 position 146, from about amino acid position 158 to about amino acid position 177, from about amino acid 
position 197 to about amino acid position 216, from about amino acid position 218 to about amino acid position 
238, from about amino acid position 245 to about amino acid position 265, and from about amino acid position 
271 to about amino acid position 290 and an amino acid sequence block having homology to G- protein coupled 
receptor proteins from about amino acid 1 15 to about amino acid 155. Clone UNQ244 (DNA 16422- 1209) has 

25 been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209929. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 2 (SEQ ID NO: 2), evidenced significant 
homology between the PR0281 amino acid sequence and the following Dayhoff sequences: H 64634, 
AF033095_1, B64815, YBHL_ECOLI, EMEQUTR l, AF064763J, S53708, A69253, AF035413J2 and 

30 S63281. 

EXAMPLE Mfltipn, of cPNA clones Encoding Human PRQ275 

In order to obtain a cDN A clone encoding PR0276, methods described in Klein et al. , PNAS. 92:7108- 
71 13 (1996) were employed with the following modifications. Yeast transformation was performed with limiting 
35 amounts of transforming DNA in order to reduce the number of multiple transformed yeast cells. Instead of 
plasmid isolation from the yeast followed by transformation of E. coli as described in Klein et al., supra, PCR 
analysis was performed on single yeast colonies. PCR primers employed were bipartite in order to amplify the 
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insert and a small portion of the invertase gene (allowing to determine that the insert was in frame with invertase) 
and to add on universal sequencing primer sites. 

An invertase library was transformed into yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR027(i f was determined 
to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
5 nucleotide sequence of PR0276. A full length plasmid library of cDN As from human fetal liver cells was titered 
and approximately 100,000 cfu were plated in 192 pools of 500 cfu/pool into 96- well round bottom plates. The 
plates were sealed and pools were grown overnight at 37 C with shaking (200rpm). PCR was performed on the 
individual cultures using primers. Agarose gel electrophoresis was performed and positive wells were identified 
by visualization of a band of the expected size. Individual positive clones were obtained by colony lift followed 
10 by hybridization with M P-labeled oligonucleotide. These clones were characterized by PCR, restriction digest, 
and southern blot analyses. 

A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 180-182 and a stop signal at nucleotide positions 933-935 
(Figure 3; SEQ ID NO:5). The predicted polypeptide precursor is 251 amino acids long has a calculated 
15 molecular weight of approximately 28,801 daltons and an estimated pi of approximately 9.58, The 
transmembrane domains are approximately at amino acids 98- 1 16 and 152-172 of the sequence shown in Figure 
4 (SEQ ID NO:6). Clone DNA16435-1208 (UNQ243) has been deposited with the ATCC and is assigned 
ATCC deposit no, 209930 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
20 alignment analysis of the full-length sequence shown in Figure 4 (SEQ ID NO: 6), revealed some sequence 
identity between the PR0276 amino acid sequence and the foi lowing Dayhoff sequences: CEG25D7 2, 
ATT805J2, S69696, GRHRRAT, NPCB AABCD3 , AB013149J, P R85942 and AP000006_5. 

EXAMPLE 6: Isolation of cDNA clones En coding Human PRQ189 

25 A clone designated herein as DNA141 87 was isolated as described in Example 2 above from a human 

retina tissue library. The DNA14187 sequencer show to Figure 7 (SEQ ID NO: 9). Based on the DNA 14 187 
sequence shown in Figure 7 (SEQ ID NO:9), oligonucleotides were synthesized: I) to identify by PCR a cDNA 
library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PRO 189. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 

30 often designed to give a PCR product of about 100- 1000 bp in length. The probe sequences are typically 40-55 
bp in length. In order to screen several libraries for a full-length clone, DNA from the libraries was screened 
by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. 
A positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide 
and one of the primer pairs. 

35 A pair of PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 '-TTG ACCTATAC AG AGATTC ATC-3 ' (SEQ ID NO: 10); and 
reverse PCR primer 5 '-CT AAGAACTTCCCTC AGG ATTTT-3 ' (SEQ ID NO: 11). 
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Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA14187 sequence 
which had the following nucleotide sequence: 
hybridization probe 

5' -ATG AAGATCAATTTC AAGAAGCATGCACTTCTCCTCTTGC-3 ' (SEQ ID NO: 12). 

In order io screen several libraries for a source of a full-length clone, DNA from the libraries was 
5 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 189 gene using the probe oligonucleotide and one of the PCR primers. 

RN A for construction of the cDNA libraries was isolated from human retina tissue (L1B94). The cDN A 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
10 site, linked with blunt to Sail hemikinased adaptors, cleaved with Notl, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; sec, Holmes et al„ Science . 221' 1278-1280 
(1991)) in the unique Xhol and Notl sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
15 PRO 189 and the derived protein sequence for PR0189. 

The entire nucleotide sequence of DNA21624-1391 is shown in Figure 5 (SEQ ID NO:7). Clone 
DNA2 1624- 1391 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 200-202 and ending at the stop codon at nucleotide positions 1301-1303 (Figure 5). The predicted 
polypeptide precursor is 367 amino acids long (Figure 6). The full-length PRO 189 protein shown in Figure 6 
20 has an estimated molecular weight of about 4 1 , 87 1 daltons and a pi of about 5 .06 . Clone DNA2 1 624- 1 39 1 has 
been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:8, the putative N-glycosylation sites are at about 
amino acids 224-227, 246-249 and 285-288. A domain for cytosolic fatty-acid binding proteins is at amino acids 
25 78-107 of SEQ ID NO:8. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

Some sequence identity was found to W01 A6. 1 and F35D1 1 . 1 1 , C. Elegans proteins, designated in a 
Dayhoff database as CEW01 A6_10 and CELF35D1 l_l 1 , respectively. Some sequence identity was also found 
to an antigen to malaria and to restin, designated in a Dayhoff database as P_R05766 and AF0140121 , 
30 respectively. Some sequence identity was also found to a microtubule binding protein and to myosin, designated 
in a Dayhoff database as AF041382_1 and S07537, respectively. There is also some sequence identity with 1- 
phosphatidylinositol-4, S-bispbosphate, designated as PIP1RAT. 

EXAMPLE 7 : Isolation of cDNA clones Encoding Human PRO190 
35 A clone designated herein as DNA 14232 was isolated as described in Example 2 above from a human 

fetal retina tissue library. The DNA14232 sequence is shown in Figure 10 (SEQ ID NO: 15). Based on the 
DNA 14232 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
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the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 190. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are often designed 
to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp in length. 
In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
amplification, as per Ausubel et ah, Current Protoco ls in Molecular Biology, with the PCR primer pair. A 
positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 
one of the primer pairs. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -CT AT ACCT ACTGT AGCTTCT-3 ' (SEQ ID NO: 16); and 
reverse PCR primer 5 ' -TCAGAGAATTCCTTCC AGG A-3 ' (SEQ ID NO: 1 7). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA14232 sequence 
which had the following nucleotide sequence: 
hybridization probe 

5'-ACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACA-3' (SEQ ID NO: 18). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO190 gene using the probe oligonucleotide and one of the PCR primers. 

RN A for construction of the cDNA libraries was isolated from human retina tissue (LIB94). The cDN A 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invirrogen. San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with Noil, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; sec. Holmes et al. , §cj£B££, 252- 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave sequences which include the full-length 
DNA sequence for PRO190 [herein designated as DNA23334-1392] (SEQ ID NO: 13) and the derived protein 
sequence for PRO190. 

The entire nucleotide sequence of DNA23334-1392 is shown in Figure 8 (SEQ ID NO: 13). Clone 
DNA23334-1392 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 193-195 and which ends at the stop codon at nucleotide positions 1465-1467 (Figure 8). The predicted 
polypeptide precursor is 424 amino acids long (Figure 9). The full-length PRO190 protein shown in Figure 9 
has an estimated molecular weight of about 48,500 daltons and a pi of about 8.65. Clone DNA23334-1392 has 
been deposited with the ATCC. Regarding the sequence, it is understood thai the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 14 , the putative transmembrane domains are at about 
amino acids 16-36, 50-74, 147-168, 229-250, 271-293 , 298-318 and 328-368 of SEQ ID NO:14. N- 
glycosylation sites are at about amino acids 128-131, 204-207, 218-221 and 274-377 of SEQ ID NO:14. The 
corresponding nucleotides can be routinely determined given the sequences provided herein. 
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PRO 190 has sequence identity with at least the following Dayhoff sequences designated as: 
CEZK896_2, JC5023, GMS1_SCHP0 and S44668. 

EXAMPLE 8 : Isolation of cDNA clones Encoding Human PRQ341 

A clone designated herein as DNA 12920 was isolated as described in Example 2 above from a human 
5 placenta tissue library. The DNA12920 sequence is shown in Figure 13 (SEQ ID NO:2I). The DNA12920 
sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a 
proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 [Altschul et al. , 
Methods in Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 

10 some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This 
consensus sequence is herein designated DNA25314. Oligonucleotide primers based upon the DNA25314 
sequence were then synthesized and employed to screen a human placenta cDNA library which resulted in the 
identification of the DNA26288-1239 clone shown in Figure 1 1 . The cloning vector was pRK5B (pRK5B is a 

1 5 precursor of pRK5D that does not contain the Sfil site; see, Holmes et al . , Science . 251: 1 278- 1 280 ( 1 99 1 )), and 
the cDNA size cut was less than 2800 bp. 

A full length clone was identified that contained a single open reading frame with an apparent 
translauonal initiation site at nucleotide positions 380-382, and a stop signal at nucleotide positions 1754-1756 
(Figure 11, SEQ ID NO: 19). The predicted polypeptide precursor is 458 amino acids long, has a calculated 

20 molecular weight of approximately 50,264 daltons and an estimated pi of approximately 8.17. Analysis of the 
full-length PR0341 sequence shown in Figure 12 (SEQ ID NO:20) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 17, transmembrane domains from about amino acid 
171 to about amino acid 190, from about amino acid 220 to about amino acid 239, from about amino acid 259 
to about amino acid 275, from about amino acid 286 to about amino acid 305. from about amino acid 3 16 to 

25 about amino acid 335, from about amino acid 353 to about amino acid 378 and from about amino acid 396 to 
about amino acid 417 and potential N-glycosylation sites from about amino acid 145 to about amino acid 147 
and from about amino acid 155 to about amino acid 158. Clone DNA26288-1239 has been deposited with 
ATCC on April 21, 1998 and is assigned ATCC deposit no. 209792. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 12 (SEQ ID NO:20), evidenced homology 
between the PR0341 amino acid sequence and the following Dayhoff sequences: S75696, H69788, D69852, 
A69888, B64918, F64752, LPU89276J, G64962, S52977 and S44253. 

EXAMPLE 9 : Isolation of cDNA clone s Encoding Human PRO180 
35 A clone designated herein as DNA 12922 was isolated as described in Example 2 above from a human 

placenta tissue library. The DNA12922 sequence is shown in Figure 16 (SEQ ID NO:24). The DNA12922 
sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a 
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proprietary EST database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 [Altschul et ah, 
Methods in Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 
some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap'' (Phil Green, University of Washington, Seattle, Washington). 

An oligonucleotide probe was formed based upon the consensus sequence obtained above. This probe 
had the following sequence. 

5 ' - ACCTGTTAG AAATGTGGTGGTTTCAGC A AGGCCTC AGTTT (SEQ ID NO:25). 
This probe was used to screen a human placenta library prepared as described in paragraph 1 of Example 2 
above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D thai does not contain the Sfil site; see, 
Holmes et al. f Science . 251:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. A clone 
designated herein as DNA26843-1389 was obtained. 

The enure nucleotide sequence of DNA26843-1389 is shown in Figure 14 (SEQ ID NO:22). Clone 
DNA26843-1389 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 121-123 and ending at the stop codon at nucleotide positions 919-921 (Figure 14). The predicted 
polypeptide precursor is 266 amino acids long (Figure 15). The full-length PRO180 protein shown in Figure 
15 has an estimated molecular weight of about 29,766 daltons and a pi of about 8.39. Clone DNA26843-1389 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO:23, the transmembrane domains are at about 
amino acids 13-33 (type II). 54-73, 94-113, 160-180 and 122-141 of SEQ ID NO:23. N-myristoylation sites 
are at about amino acids 57-62. 95-100. 99-104, 124-129 and 183-188 of SEQ ID NO:23. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 15 (SEQ ID NO:23), evidenced some sequence 
identity between the PRO180 amino acid sequence and the following Dayhoff sequences: CEC33A11J, 
CEG11E6_5, CELW03A5J AND PEU83861_2 (NADH dehydrogenase subunit 4L, mitochondrion). 

EXAMPLE 10: Isolation of cP NA clones Encoding Human PROW 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein DNA 19464. Based on the DNA19464 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 194. PCR 
primers (forward and reverse) were synthesized based upon the DNA19464 sequence. Additionally, a synthetic 
oligonucleotide hybridization probe was constructed from the consensus DNA 19464 sequence. 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 194 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0194 [herein designated as DNA26844-1394J (SEQ ID NO:27) and the derived protein sequence for 
PR0194. 

The entire nucleotide sequence of DNA26844-1394 is shown in Figure 17 (SEQ ID NO:27). Clone 
5 DNA26844- 1 394 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 81-83 and ending at the stop codon at nucleotide positions 873-875 (Figure 17). The predicted 
polypeptide precursor is 264 amino acids long (Figure 18). The full-length PRO 194 protein shown in Figure 
18 has an estimated molecular weight of about 29,665 daltons and a pi of about 9.34. Analysis of the full-length 
PRO 194 sequence shown in Figure 18 (SEQ ID NO: 28) evidences the presence of various important 
10 polypeptides domains as shown in Figure 18. Clone DNA26844-1394 has been deposited with ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209926. 

Analysis of the amino acid sequence of the full-length PRO 194 polypeptide suggests that it does not 
exhibit significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some homology between the PRO 194 amino acid sequence and 
15 the following Dayhoff sequences, HUMORFTJ, CET07F10 5, ATFCA9J2, F64934, YDJXECOLl, 
ATAF00065719F29G20.19, H70002. S76980, H64934 and S76385. 

EXAMPLE U: Isolation of cDNA clones Encoding Human PRO203 

A clone designated herein as DNA 1 56 1 8 was isolated as described in Example 2 above from a human 

20 fetal lung tissue library. The DNA15618 sequence is shown in Figure 21 (SEQ ID NO:31). Oligonucleotide 
probes were generated from the sequence of the DNA 156 18 molecule and were used to screen a human fetal lung 
library (LIB26) prepared as described in paragraph I of Example 2 above. The cloning vector was pRK5B 
(pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al. , Science . 253 : 1278-1280 
(1991)), and the cDNA size cut was less than 2800 bp. 

25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 159-161 and ending at the stop codon found at nucleotide 
positions 1200-1202 (Figure 19; SEQ ID NO: 29). The predicted polypeptide precursor is 347 amino acids long, 
has a calculated molecular weight of approximately 39,870 daltons and an estimated pi of approximately 6.76. 
Analysis of the full-length PRO203 sequence shown in Figure 20 (SEQ ID NO:30) evidences the presence of 

30 the following: a type D transmembrane domain at about amino acid 64 to about amino acid 87; possible N- 
glycosylauon sites at about amino acid 147 to about amino acid 150, about amino acid 155 to about amino acid 
158, and about amino acid 237 to about amino acid 240; sequence identity with heavy-metal -associated domain 
proteins at about amino acid 23 to about amino acid 45, and sequence identity with D-isomer specific 2- 
hydroxyacid dehydrogenase at about amino acid 24 to about amino acid 34. Clone DNA30862-1396 was 

35 deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209920. 

Analysis of the amino acid sequence of the full-length PRO203 polypeptide suggests that it possesses 
sequence similarity to GST ATPase, thereby indicating that PRO203 may be a novel GST ATPase. More 
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specifically, an analysis of the Dayhoff database (version 35.45 SwissProi 35) evidenced homology between the 
PRO203 amino acid sequence and the following Dayhoff sequences, AF008124_1, CFRCDIGEN1, and 
PR82566. 

EXAMPLE B: Isolation of cDNA clones Encoding Human PRO290 
5 An expressed sequence tag (EST) DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) 

was searched and an EST was identified that had homology to beige and FAN. An oligonucleotide probe based 
upon the identified EST sequence was then synthesized and used to screen human fetal kidney cDNA libraries 
in an attempt to identify a full-length cDNA clone. The oligonucleotide probe had the following sequence: 
5' TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTG 3* (SEQ ID NO:34). 

10 RNA for construction of cDNA libraries was isolated from human fetal kidney tissue. The cDNA 

libraries used to isolate the cDNA clones encoding human PRO290 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oiigo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with Not I, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 

IS pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science 
253:1278-1280 (1991)) in the unique Xhol and NotI. 

A cDNA clone was identified and sequenced in entirety. The entire nucleotide sequence of DNA35680- 
12 12 is shown in Figure 22 (SEQ ID NO: 32). Clone DNA35680- 1212 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 293-295, and a stop codon at nucleotide positions 

20 3302-3304 (Figure 22; SEQ ID NO;32). The predicted polypeptide precursor is 1003 amino acids long. 

It is currently believed that the PRO290 polypeptide is related to FAN and/or beige. Clone DNA35680- 
1212 has been deposited with ATCC and is assigned ATCC deposit no. 209790. It is understood that the 
deposited clone has the actual correct sequence rather than the representations provided herein. The full-length 
PRO290 protein shown in Figure 23 has an estimated molecular weight of about 112,013 daltons and a pi of 

25 about 6.4. 

EXAMPLE 13: Isolation of cDNA Clones Encoding Human PRQ874 

A consensus DNA sequence designated herein as DNA36459 was identified using phrap as described 

in Example 1 above. Based on the DNA36459 consensus sequence, oligonucleotides were synthesized: 1) to 
30 identify by PGR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a 

clone of the coding sequence for PR0874. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 ' -TCGTGCCC AGGGGCTG ATGTGC-3 ' (SEQ ID NO:37); and 

reverse PCR primer 5'-GTCTTTACCCAGCCCCGGGATGCG-3' (SEQ ID NO:38). 
35 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36459 

sequence which bad the following nucleotide sequence: 



385 



WO 99/63088 



PCT/US99/12252 



hybridization probe 

S'-GGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGG-S' (SEQ ID NO:39). 

In order to screen several libraries for a source of a clone, DNA from the libraries was screened by 
PCR amplification with the PGR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0874 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction 
5 of the cDNA libraries was isolated from human fetal lung tissue (L1B25). 

DNA sequencing of the clones isolated as described above gave the DNA sequence for PR0874 [herein 
designated as DNA4062 1-1440] (SEQ ID NO:35) and the derived protein sequence for PR0874. 

The entire nucleotide sequence of DNA40621-1440 is shown in Figure 24 (SEQ ID NO:35). Clone 
DNA4062 1-1440 contains a single open reading frame ending at the stop codon at nucleotide positions 964-966 
10 (Figure 24). The predicted polypeptide encoded by DNA4062 1-1440 is 321 amino acids long (Figure 25). The 
PR0874 protein shown in Figure 25 has an estimated molecular weight of about 36, 194 daltons and a pi of aboui 
9.85. Analysis of the PR0874 sequence shown in Figure 25 (SEQ ID NO: 36) evidenced the presence of the 
following: a type II transmembrane domain at about amino acids 57-80; additional transmembrane domains at 
about amino acids 1 10-126, 215-231, and 254-274; potential N-glycosylation sites at about amino acids 16-19, 
15 27-30, and 289-292; sequence identity with hypothetical YBR002c family proteins at about amino acids 276-287; 
and sequence identity with ammonium transporter proteins at about amino acids 204-230. Clone DNA40621- 
1440 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209922. 

Analysis of the amino acid sequence of the PR0874 polypeptide suggests that it is a novel multi-span 
transmembrane protein. However, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
20 sequence identity between the PR0874 amino acid sequence and the following Dayhoff sequences: S67049, 
AF054839J, S73437, S52460, and HIVU80570J. 

EXAMPLE 14: Isolation of cDNA Clones Encoding Human PRQ7 10 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 
25 Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 

designated as DNA38I90) is shown in Figure 28 (SEQ ID NO:42). Based on the DNA38190 sequence shown 

in Figure 28, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 

sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO7I0. 

In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
30 amplification, as per Ausubel et al., Current Protocols in Molecular Biolo«v. with the PCR primer pair. A 

positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 

one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5'-TTCCGCAAAGAGTTCTACGAGGTGG-3' (SEQ ID NO:43) 
35 reverse PCR primer 5'-ATTGACAACATTGACTGGCCTATGGG-3' (SEQ ID NO:44) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA38I90 sequence 

which had the following nucleotide sequence 
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hybridization probe 

5 ' -GTGGATGCTCTGTGTGCGTGCAAG ATCCTTCAGGCCTTGTTCCAGTGTGA-3 ' (SEQ ID NO:45) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amp ;ification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO710 gene using the probe oligonucleotide and one of the PCR primers. 
5 RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with Nod, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 

10 pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al. ( Science . 
253 : 1278-1280 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 67-69 and ending at the stop codon found at nucleotide positions 
1765-1767 (Figure 26, SEQ ID NO:40). The predicted polypeptide precursor is 566 amino acids long, has a 

15 calculated molecular weight of approximately 65,555 daltons and an estimated pi of approximately 5.44. 
Analysis of the full-length PRO710 sequence shown in Figure 27 (SEQ ID NO:41) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 32, a transmembrane domain from 
about amino acid 454 to about amino acid 476, an aminoacyl-transfer RNA synthetase class-II signature sequence 
from about amino acid 6 to about amino acid 26 and potential N-glycosylation sites from about amino acid 1 1 1 

20 to about amino acid 114, from about amino acid 146 to about amino acid 149 and from about amino acid 292 
to about amino acid 295. Clone DNA44161-1434 has been deposited with ATCC on May 27, 1998 and is 
assigned ATCC deposit no. 209907. 

Analysis of the amino acid sequence of the full-length PRO710 polypeptide suggests that it possesses 
significant sequence similarity to the CDC45 protein, thereby indicating that PR07 10 may be a novel CDC45 

25 homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO710 amino acid sequence and the following Dayhoff sequences, 
HSAJ3728_1. CEF34D10J, S64939, UMU50276J, TRHY SHEEP, CELT14E8J, RNA1_YEAST, 
LVU89340 1, HSU80736J and CEZK337_2. 

30 EXAMPLE 15: Isolation of cDNA clones Encoding Human PRO 1151 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
m Example 1 above. This consensus sequence is herein designated DNA40665. Based on the DNA40665 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 

35 PROU51. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 '-CCAGACGCTGCTCTTCG A A AGGGTC- V (SEQ ID NO:48) 
reverse PCR primer S'-GGTCCCCGTAGGCCAGGTCCAGM' (SEQ ID NO:49) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40665 
sequence which had the following nucleotide sequence 
5 hYMflHMipn P^be 

5-CTACTTCTTCAGCCTCAATGTGCACAGCTGGAATTACAAGGAGACGTACG-3' (SEQ ID NO:50) 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 1 15 1 gene using the probe oligonucleotide and one of the PCR primers. RN A 
10 for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 151 (designated herein as DNA44694-1500 [Figure 29, SEQ ID NO:461 ; and the derived protein sequence 

for PR01151. 

The enure nucleotide sequence of DNA44694-1500 is shown in Figure 29 (SEQ ID NO:46). Clone 
15 DNA44694-1500 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 272-274 and ending at the stop codon at nucleotide positions 1049-1051 (Figure 29). The predicted 
polypeptide precursor is 259 amino acids long (Figure 30). The full-length PROl 151 protein shown in Figure 
30 has an estimated molecular weight of about 28,770 daltons and a pi of about 6. 12. Analysis of the full-length 
PROl 151 sequence shown in Figure 30 (SEQ ID NO:47) evidences the presence of the following: a signal 
20 peptide from about amino acid 1 to about amino acid 20, a potential N-glycosylation site from about amino acid 
72 to about amino acid 75 and amino acid sequence blocks having homology to Clq domain-containing proteins 
from about amino acid 144 to about amino acid 178, from about amino acid 78 to about amino acid 1 1 1 and from 
about amino acid 84 to about amino acid 117. Clone UNQ581 (DNA44694-1500) has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203114. 
25 An analysis of the Dayhoff database (version 35.45 SwissProt 35)> using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO:47), evidenced significant 
homology between the PROl 151 amino acid sequence and the following Dayhoff sequences: ACR3_HUMAN, 
HP25_TAMAS, HUMC1QB2J, P_R99306, CA1F_HUMAN, JX0369, CA24JIUMAN, S32436, P_R289I6 
and CA54_HUMAN. 

30 

EXAMPLE \$: Isolation of cDNA clones Encoding Human PRQ1282 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA33778. Based on theDNA33778 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1282. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 'TCTTC AGCCGCTTGCGC AACCTC3 ' (SEQ ID NO:53); and 
reverse PCR primer 5'TTGCTCACATCCAGCTCCTGCAGG3' (SEQ ID NO:54). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DN A3 3778 sequence which bad the following nucleotide sequence: 
5 hybridization probe 

5'TGGATGTTGTCCAGACAACCAGCTGGAGCTGTATCCGAGGC3 , (SEQ ID NO:55). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1282 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO 1 282 (designated herein as DNA45495- 1 550 [Figure 3 1 , SEQ ID NO:5 1 ] ; and the derived protein sequence 
for PRO 1 282. 

The entire coding sequence of PRO 1282 is shown in Figure 3 1 (SEQ ID NO:51). Clone DNA45495- 
15 1550 contains a single open reading frame with an apparent translation^ initiation site at nucleotide positions 
120-122, and an apparent stop codon at nucleotide positions 2139-2141 (SEQ ID NO:51). The predicted 
polypeptide precursor is 673 amino acids long. The signal peptide is at about amino acids 1-23; the 
transmembrane domain is at about amino acids 579-599; an EGF-like domain cysteine pattern signature starts 
at about amino acid 430; and leucine zipper patterns start at about amino acids 197 and 269 of SEQ ID NO: 52, 
20 see Figure 32. Clone DNA45495-1550 has been deposited with the ATCC and is assigned ATCC deposit no. 
203156. The full-length PRO 1282 protein shown in Figure 32 has an estimated molecular weight of about 
71,655 daltons and a pi of about 7.8. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 32 (SEQ ID NO:52), revealed sequence identity 
25 between the PRO 1282 amino acid sequence and the following Dayhoff sequences (data from database 
incorporated by reference): AB007876J, RNPLGPVJ, MUSLRRPJ, ALS_PAPPA, AC004142J, 
ALSHUMAN, ABO 14462 J, DMT ART AN J, HSCHON03 1 and S46224. 

EXAMPLE 17: Isolation of cDNA clones Encoding Human PRQ358 
30 Using the method described in Example 1 above, a single EST sequence was identified in the lncyte 

database, designated herein as INC3 115949. Based on the INC31 15949 EST sequence, oligonucleotides were 

synthesized to identify by PCR a cDNA library that contained the sequence of interest and for use as probes to 

isolate a clone of the full-length coding sequence for PR0358. 

A pair of PCR primers (forward and reverse) were synthesized: 
35 forward PCR primer 5' -TCCCACC AGGTATCATAAACTGAA-3' (SEQ ID NO:58) 

reverse PCR primer 5'-TTATAGACAATCTGTTCTCATCAGAGA-3' (SEQ ID NO:59) 
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A probe was also synthesized: 

5 * - A A AA AGC AT ACTTGG A ATGGCCC AAGG ATAGGTGT AA ATG- 3 * (SEQ ID NO:60) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then -lsed io 
isolate clones encoding the PR0358 gene using the probe oligonucleotide and one of the PCR primers. RNA 
5 for construction of the cDNA libraries was isolated from human bone marrow (LIB256). The cDNA libraries 
used to isolated the cDNA clones were constructed by standard methods using commercially available reagents 
such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI site, 
linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, 
and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor 
10 of pRKSD that does not contain the Sfil site; see, Holmes et ah, Science . 253 : 1278-1280 (1991)) in the unique 
Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0358 (Figure 33, SEQ ID NO:56) and the derived protein sequence for PR0358 (Figures 34, SEQ ID 
NO;57). 

15 The entire nucleotide sequence of the clone identified (DNA47361- 1 154) is shown in Figure 33 (SEQ 

ID NO:56). Clone DNA47361-1154 contains a single open reading frame with an apparent translational 
initiation site (ATG start signal) at nucleotide positions underlined in Figure 33. The predicted polypeptide 
precursor is 811 amino acids long, including a putative signal sequence (amino acids 1 to 19), an extracellular 
domain (amino acids 20 to 575, including leucine rich repeats in the region from position 55 to position 575), 

20 a putative transmembrane domain (amino acids 576 to 595). Clone DNA4736I - 1249 has been deposited with 
ATCC and is assigned ATCC deposit no. 209431. 

EXAMPLE 18: Isolation of cD NA clones Encoding Human PRO 13 10 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
25 in Example 1 above. This consensus sequence is designated herein as DNA37164. Based on the DNA37164 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO1310. 

PCR primers (forward and reverse) were synthesized: 
30 forward PCR primer: 5 'GTTCTC AATGAGCTACCCGTCCCC3 * (SEQ ID NO:63) and 
reverse PCR DrimeriS'CGCGATGTAOTCTGAACTCGGTTrT^' (SEQ ID NO:64). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA47394 sequence which had the following nucleotide sequence: 

35 5'ATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCTGCATGAG3' (SEQ ID NO:65). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
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isolate clones encoding the PRO 13 10 gene using the probe oligonucleotide and one of the PC R primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO1310 sad the derived protein sequence for PRO1310. 

The enure coding sequence of PRO1310 is shown in Figures 35A-B (SEQ ID NO:61). Clone 
5 DNA47394-1572 contains a single open reading frame with an apparent translauonal initiation site at nucleotide 
positions 326-328, and an apparent stop codon at nucleotide positions 2594-2596 (SEQ ID NO:61). The 
predicted polypeptide precursor is 765 amino acids long. The signal peptide is at about amino acids 1 -25 of SEQ 
ID NO:62. Clone DNA47394-1572 has been deposited with ATCC and is assigned ATCC deposit no. 203109. 
The full-length PRO 13 10 protein shown in Figure 36 has an estimated molecular weight of about 85,898 daltons 
10 and a pi of about 6.87. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 36 (SEQ ID NO: 62), revealed sequence identity 
between the PRO1310 amino acid sequence and the following Dayhoff sequences: AF017639J, PW36817, 
JC5256, CBPHHUMAN, MMU23184J, CBPN_HUMAN, HSU8341M. CEF01D4_7, RNU62897J and 
15 P_W11851. 

EXAMPLE 19 : Isolation of cDNA Clones Encoding Human PRQ698 

A yeast screening assay was employed to identify cDN A clones that encoded potential secreted proteins. 

Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
20 designated as DNA39906) is shown in Figure 39 (SEQ ID NO:68). Based on the DNA39906 sequence shown 

in Figure 39, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 

sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0698. 

In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 

amplification, as per Ausubel et ah. Current Protocols in Molecular Biology, with the PCR primer pair. A 
25 positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 

one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 ' - AGCTGTGGTC ATGGTGGTGTGGTG-3 ' (SEQ ID NO:69) 

reverse PCR primer 5 ' -CTACCTTGGCC AT AGGTG ATCCGC-3 ' (SEQ ID NO:70) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39906 

sequence which had the following nucleotide sequence 

hybridization probe 

5'-CATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGAGAGGGTT-3' (SEQ ID NO:71) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0698 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human bone marrow tissue (UB255). The cDNA 
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libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogcn, San Diego, CA. The cDNA was primed with oligo dT containing a Noil 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et a!., Science . 253: 1278-1280 
5 (1991)) in the unique Xhol and NotI sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 1 4 - 1 6 and ending at the stop codon found at nucleotide positions 
1544-1546 (Figure 37, SEQ ID NO: 66). The predicted polypeptide precursor is 510 amino acids long, has a 
calculated molecular weight of approximately 57,280 daltons and an estimated pi of approximately 5.61. 

10 AnaJysis of the full-length PR0698 sequence shown in Figure 38 (SEQ ID NO:67) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 20, potential N-glycosylation sites 
from about amino acid 72 to about amino acid 75, from about amino acid 136 to about amino acid 139, from 
about amino acid 193 to about amino acid 196, from about amino acid 253 to about amino acid 256, from about 
amino acid 352 to about amino acid 355 and from about amino acid 41 1 to about amino acid 4 14 an amino acid 

15 block having homology to legume lectin beta-chain proteins from about amino acid 20 to about amino acid 39 
and an amino acid block having homology to the HBGF/FGF family of proteins from about amino acid 338 to 
about amino acid 365. Clone DNA48320-1433 has been deposited with ATCC on May 27, 1998 and is assigned 
ATCC deposit no. 209904. 

Analysis of the amino acid sequence of the full-length PR0698 polypeptide suggests that it possesses 

20 significant sequence similarity to the olfactomedin protein, thereby indicating that PR0698 may be a novel 
olfactoraedin homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0698 amino acid sequence and the following Dayhoff sequences, 
OLFMRANCA, 173637, AR006686S3J , RNU78105J, RNU72487J, PR98225, CELC48E7_4, 
CEF11C3J, XLU85970J and S42257. 

25 

EXAMPLE 2Q: Isolation of cDNA Clones Encoding Human PRQ732 

A yeast screening assay was employed to identify cDN A clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
designated as DNA42580) is shown in Figure 45 (SEQ ID NO:77). The DNA42580 sequence was then 

30 compared to a variety of known EST sequences to identify homologies. The EST databases employed included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
(Altshul etal., Methods in Enzvmoiogv 266:460-480 (19%)) as a comparison to a 6 frame translation of the EST 
sequence. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 

35 encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). 
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Using the above analysis, a consensus DNA sequence was assembled relative to other EST sequences 
using phrap. This consensus sequence is herein designated consenOl. Proprietary Genentech EST sequences 
were employed in the consensus assembly and they are herein designated DNA20239 (Figure 42; SEQ ID 
NO:74), DNA38050 (Figure 43; SEQ ID NO:75) and DNA40683 (Figure 44; SEQ ID NO:76). 

Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 
5 library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0732. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 
often designed to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 
bp in length. In some cases, additional oligonucleotides are synthesized when the consensus sequence is greater 
than about l-1.5kbp. In order to screen several libraries for a full-length clone, DNA from the libraries was 

10 screened by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 
oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR Primer 5 ' - ATGTTTGTGTGGAAGTGCCCCG-3 ' (SEQ ID NO:78) 

15 forward PCR primer 5'-GTCAACATGCTCCTCTGC-3' (SEQ ID NO:79) 

reverse PCR primer 5 '-AATCCATTGTGCACTGCAGCTCTAGG-3 1 (SEQ ID NO:80) 

reverse PCR primer 5 ' -G AGC ATGCC ACC ACTGG ACTG AC - 3 ' (SEQ ID NO:81) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA44143 

sequence which had the following nucleotide sequence 

20 hybridization probe 

5'-GCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCAC-3' (SEQ ID 
NO: 82) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

25 to isolate clones encoding the PR0732 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB26). The 
cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a Notl site, linked with blunt to Sail hemilcinased adaptors, cleaved with NotI, sized appropriately 

30 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science . 
253:1278-1280 (1991)) in the unique Xhol and Notl sites. 

A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 88-90 and ending at the stop codon found at nucleotide positions 

35 1447-1449 (Figure 40, SEQ ID NO: 72). The predicted polypeptide precursor is 453 amino acids long, has a 
calculated molecular weight of approximately 50,419 daltons and an estimated pi of approximately 5.78. 
Analysis of the full-length PR0732 sequence shown in Figure 41 (SEQ ID NO: 73) evidences the presence of 
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the following: a signal peptide from about amino acid 1 to about amino acid 28, transmembrane domains from 
about amino acid 37 to about amino acid 57, from about amino acid 93 to about amino acid 109, from about 
amino acid 126 to about amino acid 148, from about amino acid 151 to about amino acid 172, from about amino 
acid 197 to about amino acid 215, from about amino acid 231 to about amino acid 245, from about amino acid 
260 to about amino acid 279, from about amino acid 315 to about amino acid 333, from about amino acid 384 
5 to about amino acid 403 and from about amino acid 422 to about amino acid 447 , potential N-glycosylation sites 
from about amino acid 33 to about amino acid 36, from about amino acid 34 to about amino acid 37, from about 
amino acid 179 to about amino acid 183, from about amino acid 298 to about amino acid 301 , from about amino 
acid 337 to about amino acid 340 and from about amino acid 406 to about amino acid 409, an amino acid block 
having homology to the M1P family of proteins from about amino acid 1 19 to about amino acid 149 and an 

1 0 amino acid block having homology to DNA/RN A non-specific endonuclease proteins from about amino acid 279 
to about amino acid 286. Clone DNA48334-I435 has been deposited with ATCC on June 2, 1998 and is 
assigned ATCC deposit no. 209924. 

Analysis of the amino acid sequence of the full-length PR0732 polypeptide suggests that it possesses 
significant sequence similarity to the Diff33 protein, thereby indicating that PR0732 may be a novel Diff33 

15 homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0732 amino acid sequence and the following Dayhoff sequences, 
HS179M20_2, MUSTETUJ, CER11H6J2, RATDRP1, S51256. E69226, AE000869J. JC4120, 
CYBPARTE and PJR50619. 

20 EXAMPLE 21 : Isolation of cDNA clones Encoding Human PR0 11 20 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is designated herein consen0352. The consen0352 sequence was 

then extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible 

using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as 
25 DNA34365. Based on the DNA34365 consensus sequence, oligonucleotides were synthesized: 1) to identify 

by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the 

full-length coding sequence for PRO U 20. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primers : 5 ' -G AAGCCGGCTGTCTG AATC -3 ' (SEQ ID NO:85), 
30 5'-GGCCAGCTATCTCCGCAG-3' (SEQ ID NO: 86), 5'-AAGGGCCTGCAAGAGAAG-3' (SEQ1DN0:87), 

5 * -C ACTGGG AC AACTGTGGG-3 * (SEQ ID NO:88), 

5 * -C AG AGGC A ACGTGG AG AG-3 ' (SEQ ID NO:89), and 

5*-AAGTATTGTCATAC AGTGTTC-3 ' (SEQ ID NO:90); 

reverse PCR primers : 5 -TAGTACTTGGGCACGAGGTTGGAG-3' (SEQ ID NO:91), and 5 1 - 
35 TCATACCAACTGCTGGTCATTGGC-3' (SEQ ID NO:92). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA34365 
consensus sequence which had the following nucleotide sequence: 
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hybridization probe: 

5 ' -CTC AAGCTGCTGG AC ACGG AGCGGCCGGTG AATCGGTTTC ACTTG-3 ' (SEQ ID NO:93). 

In order lo screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PROl 120 gene using the probe oligonucleotide and one of the PCR primers. RNA 
5 for construction of the cONA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 120 (designated herein as DNA48606-1479 [Figures 46A-B, SEQ ID NO:83]; and the derived protein 
sequence for PRO1120. 

The entire coding sequence of PRO1120 is shown in Figures 46A-B (SEQ ID NO:83). Clone 
10 DNA48606-1479 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 608-610 and an apparent stop codon at nucleotide positions 3209-321 1 . The predicted polypeptide 
precursor is 867 amino acids long. The full-length PRO 11 20 protein shown in Figure 47 has an estimated 
molecular weight of about 100,156 Daltons and a pi of about 9.44. Additional features of the PROl 120 
polypeptide include a signal peptide at about amino acids 1-17; a sulfatase signature at about amino acids 86-98; 
15 regions of homology to sulfatases at about amino acids 87-106, 133-146, 216-229, 291-320, and 365-375; and 
potential N-glycosylation sites at about amino acids 65-68, 112-115, 132-135, 149-152, 171-174, 198-201,241- 
245, 561-564, 608-611, 717-720, 754-757, and 764-767. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 47 (SEQ ID NO:84), revealed significant 
20 homology between the PRO1120 amino acid sequence and the following Dayhoff sequences: CELK09C41, 
GL6S HUMAN, G65 1 69, NCU89492 1 , BCU44852_ 1 , E64903 , P R5 1 355, STSHUM AN, G A6SHUM AN , 
and IDS_MOUSE. Clone DNA48606-1479 was deposited with the ATCC on July 1, 1998, and is assigned 
ATCC deposit no. 203040. 

25 EXAMPLE 22: Isolation of cDN A clones Encoding Human PRQ537 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated as Incyte EST cluster no. 29605. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 

30 CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48350. 

35 In light of an observed sequence homology between the DNA48350 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. R63443, the Merck EST clone R63443 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 48 and is herein designated as DNA49I41-1431. 

Clone DNA4914 1-1431 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 97-99 and ending at the stop codon at nucleotide positions 442-444 (Figure 48). The 
predicted polypeptide precursor is 115 amino acids long (Figure 49). The full-length PR0537 protein shown 
in Figure 49 has an estimated molecular weight of about 13, 183 daltons and a pi of about 12. 13. Analysis of 
5 the full-length PR0537 sequence shown in Figure 49 (SEQ ID NO 95) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 3 1, a potential N-giycosylation site from about 
amino acid 44 to about amino acid 47, potential N-myristolation sites from about amino acid 3 to about amino 
acid S and from about amino acid 16 to about amino acid 21 and an amino acid block having homology to 
multicopper oxidase proteins from about amino acid 97 to about amino acid 105. Clone DNA49 14 1-1431 has 

10 been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203003. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 49 (SEQ ID NO:95), evidenced homology 
between the PR0537 amino acid sequence and the following Dayhoff sequences: A54523, CELF22H10_2, 
FKH4MOUSE, OTX 1_HUMAN, URB 1 USTMA, KNOBPLAFN , A32895_ I , AF036332_ 1 , HRGHUMAN 

15 and HRP3_PLAFS. 

EXAMPLE 23 : Isolation of cDNA clones Encoding Human PRQ536 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clu2437.init. This EST cluster sequence was 

20 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in EnzvmoloKV 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

25 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48351. 

In light of an observed sequence homology between the DNA4835 1 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H 1 1 129, the Merck EST clone H 11 129 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 50 and is herein designated as DNA49 142- 1430. 

Clone DNA49142-1430 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 48-50 and ending at the stop codon at nucleotide positions 987-989 (Figure 50). The 
predicted polypeptide precursor is 313 amino acids long (Figure 51). The full-length PR0536 protein shown 
in Figure 51 has an estimated molecular weight of about 34, 189 daltons and a pi of about 4.8. Analysis of the 

35 full-length PR0536 sequence shown in Figure 51 (SEQ ID NO:97) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a potential N-glycosylation site from about amino 
acid 45 to about amino acid 48 and an amino acid sequence block having homology to sulfatase proteins from 
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about amino acid 16 to about amino acid 26. Clone DNA49 142- 1430 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203002. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 51 (SEQ ID NO: 97), evidenced homology 
between the PR0536 amino acid sequence and the following Dayhoff sequences: APU46857J, PK2JMCDI, 
5 H64743, F5I14J8, CEAM_ECOLI, GEN14267, H64965, TCU39815J, PSBJ_ODOSI and P_R06980. 

EXAMPLE 24: of qPNA clones Encoding Human PRQ535 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss xlu 12694. init. This EST cluster sequence was 

10 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

15 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA48352. Two 
propietary Genentech EST sequences were employed in the assembly are are herein shown in Figures 54 and 
55. 

In light of an observed sequence homology between the DNA48352 consensus sequence and an EST 
20 sequence encompassed within the Merck EST clone no. H 86994, the Merck EST clone H86994 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 52 and is herein designated as DNA49143-1429. 

Clone DNA49 143- 1429 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 78-80 and ending at the stop codon at nucleotide positions 681-683 (Figure 52). The 
25 predicted polypeptide precursor is 201 amino acids long (Figure 53). The full-length PR0535 protein shown 
in Figure 53 has an estimated molecular weight of about 22, 1 80 daltons and a pi of about 9.68. Analysis of the 
full-length PR0535 sequence shown in Figure 53 (SEQ ID NO:99) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a transmembrane domain from about amino acid 
155 to about amino acid 174, a potential N-glycosylation site from about amino acid 196 to about amino acid 
30 199 and FKBP-rype peptidyl-prolyl cis-trans isomer signature sequences from about amino acid 62 to about 
amino acid 77, from about amino acid 87 to about amino acid 1 23 and from about amino acid 128 to about amino 
acid 141. Clone DNA49143-1429 has been deposited with ATCC on June 23, 1998 and is assigned ATCC 
deposit no. 203013. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST- sequence 
35 alignment analysis of the full-length sequence shown in Figure 53 (SEQ ID NO: 99), evidenced homology 
between the PR0535 amino acid sequence and the following Dayhoff sequences : S7 1 237 , P_R9355 1 , P R28980, 
S71238, FKB2_HUMAN, CELC05C8J, S55383, S72485, CELC50F2 6 and S75144. 
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EXAMPLE 25 : Isolation of cDNA clones Enco fljn F Human PRQ718 

A cDNA sequence isolated in the amylase screen described in Example 2 (human fetal lung library) 
above is herein designated DNA43512 (see Figure 62; SEQ ID NO: 108). The DNA43512 sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to 
5 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA45625 . Proprietary 

10 Genentech EST sequences were employed in the assembly and are herein shown in Figures 58-61 . 

Based on the DNA45625 sequence, oligonucleotide probes were generated and used to screen a human 
fetal lung library (LIB25) prepared as described in paragraph 1 of Example 2 above. The cloning vector was 
pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 
253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 

15 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5' -GGGTGGATGGTACTGCTGCATCC-3 ' (SEQ ID NO: 109) 
reverse PCR primer 5 ' -TGTTGTGCTGTGGGAA ATC AG ATGTG-3 ' (SEQ ID NO: 1 10) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45625 sequence 
which had the following nucleotide sequence: 

20 hybridization probe 

5 -GTGTCTGGAGGCTGTGGCCGTTTTG'n i TCMGGGCTAAAATCGGG-3' (SEQ ID NO: 111) 

In order to screen several libraries for a source of a full -length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0718 gene using the probe oligonucleotide and one of the PCR primers, 

25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 36-38 and ending at the slop codon found at nucleotide positions 
607-609 (Figure 56; SEQ ID NO: 102). The predicted polypeptide precursor is 157 amino acids long, has a 
calculated molecular weight of approximately 17,400 daltons and an estimated pi of approximately 5.78. 
Analysis of the full-length PR0718 sequence shown in Figure 57 (SEQ ID NO: 103) evidences the presence of 

30 the following: a type II transmembrane domain from about amino acid 21 to about amino acid 40, and other 
trans membrane domains at about amino acid 58 to about amino acid 78, about amino acid 95 to about amino acid 
114, and about amino acid 127 to about amino acid 147; a cell attachment sequence from about amino acid 79 
to about amino acid 81; and a potential N-glycosylauon site from about amino acid 53 to about ammo acid 56. 
Clone DNA49647-1398 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 

35 209919. 

Analysis of the amino acid sequence of the full-length PR0718 polypeptide suggests that it possesses 
no significant sequence similarity to any known protein. However, an analysis of the Dayhoff database (version 
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35.45 SwissProt 35) evidenced some degree of homology between the PR0718 amino acid sequence and the - 
following Dayhoff sequences: AF045606J, AF039906J, SPBC8D2_2, S63441, F64728, COX1TRYBB, 
F64375, E64173, RPYGJT3, MTCY261J3. 

EXAMPLE 26 : Isolation of cDNA clones Enco ding Human PRQ872 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence designated herein as clu 120709. init. The clul 20709. init sequence was then compared a 

proprietary EST DNA database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

et al., Methods in Enzvmology 266 :460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

DNA sequence with the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The 

consensus sequence obtained therefrom is herein designated DNA48254. 

In light of an observed sequence homology between the DNA48254 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3438068, the Incyte EST clone 3438068 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 63 and is the full-length DNA sequence for PR0872. 

Clone DNA49819-1439 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 

209931. 

The entire nucleotide sequence of DNA49819-1439 is shown in Figure 63 (SEQ ID NO:l 12). Clone 
20 DNA49819-1439 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 14-16 and ending at the stop codon at nucleotide positions 1844-1846 (Figure 63). The predicted 
polypeptide precursor is 610 amino acids long (Figure 64). The full-length PR0872 protein shown in Figure 
64 has an estimated molecular weight of about 66,820 daltons and a pi of about 8.65. Analysis of the full-length 
PR0872 sequence shown in Figure 64 (SEQ ID NO: 113) evidences the presence of the following features: a 
25 signal peptide at amino acid 1 to about 18, putative transmembrane domains at about amino acids 70-87, 200-222 
and 568-588; sequence identity with bacterial-type phytoene dehydrogenase protein at about amino acids 71-105; 
sequence identity with a regulator of chromosome condensation (RCC1) signature 2 at about amino acids 201- 
211; leucine zipper patterns at about amino acids 214-235, 221-242, 228-249 and 364-385; a potential N- 
glycosylation site at about amino acids 271-274; and a glycosaminoglycan attachment site at about amino acids 
30 75-78. Analysis of the amino acid sequence of the full-length PR0872 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PR0872 amino acid sequence and the following 
Dayhoff sequences: PRCRTIJ, S75951, S74689, CELF37C4 3, CRTI_RHOCA, S76617, YNI2METTL, 
MTV014J4, AOFBHUMAN, and MMU70429 1. 

35 EXAMPLE 27 : Isolation of cDNA clones Encoding Human PRO1063 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as ss. clu 11 9743. init. The Incyte EST cluster sequence 



399 



WO 99/63088 



PCT/US99/12252 



ss.clul 19743. init sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 ( AJtshul et al . , Methods ir Enzymology 266:460^480 ( 1 996)) . Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
5 proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA48288. 

In light of an observed sequence homology between the DNA48288 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2783726, the Incyte EST clone 2783726 was purchased 

10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 65 and is herein designated DNA49820-1427. 

The full length clone shown in Figure 65 contained a single open reading frame with an apparent 
transiational initiation site at nucleotide positions 90-92 and ending at the stop codon found at nucleotide positions 
993-995 (Figure 65; SEQ ID NO: 1 14). The predicted polypeptide precursor is 301 amino acids long, has a 

15 calculated molecular weight of approximately 33,530 daltons and an estimated pi of approximately 4.80. 
Analysis of the full-length PRO 1063 sequence shown in Figure 66 (SEQ ID NO: 1 15) evidences the presence of 
the following: a signal peptide from about amino acid I to about amino acid 21, potential N-glycosylation sites 
from about amino acid 195 to about amino acid 198, from about amino acid 217 to about amino acid 220 and 
from about amino acid 272 to about amino acid 275 , a glycosaminoglycan attachment site from about amino acid 

20 267 to about amino acid 270, a microbodies C-tenninal targeting signal site from about amino acid 299 to about 
amino acid 301, a type II fibronectin collagen-binding domain homology sequence from about amino acid 127 
to about amino acid 168 and a fructose-bisphosphate aldolase class II protein homology sequence from about 
amino acid 101 to about amino acid 1 18. Clone DNA49820- 1427 has been deposited with the ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209932. 

25 Analysis of the amino acid sequence of the full-length PRO 1063 polypeptide suggests that it possesses 

sequence similarity to the human type IV collagenase protein. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO 1063 amino acid 
sequence and the following Dayhoff sequences, S68303, CFU68533J, P_P91139, RNU65656J, 
PA2R RABIT, MMU56734J, FINC_XENLA, A48925, P_R92778 and FA12_HUMAN. 

30 

EXAMPLE 28: Isolation of cDNA clones Encoding Human PRQ619 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 88434. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
35 GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al., Methods in EnzvmoloRV 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
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of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1656694, the Incyte EST clone 1656694 was purchased and the 

5 cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 67 and is herein designated as DNA4982 1-1562. 

The full length clone shown in Figure 67 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 81-83 and ending at the stop codon found at nucleotide positions 
450-452 (Figure 67; SEQ ID NO: 116). The predicted polypeptide precursor (Figure 68, SEQ ID NO: 117) is 

10 123 amino acids long including a predicted signal peptide at about amino acids 1-20. PR0619 has a calculated 
molecular weight of approximately 13,710 daltons and an estimated pi of approximately 5.19. Clone 
DNA49821-1562 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit no. 209981. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 68 (SEQ ID NO:l 17), revealed significant 

1 5 homology between the PR06 19 amino acid sequence and the following Dayhoff sequences: S35302, D87009_ 1 , 
HSU93494J, HUMIGLAM51, D86999_2, HUMIGLYMll, HUMIGLYMKEJ, A29491J. A29498J, 
and VPR2_MOUSE. 

EXAMPLE 29: Isolation of cDNA clones Encoding Human PRQ943 

20 A consensus DNA sequence encoding PR0943 was assembled relative to other EST sequences using 

phrap as described in Example 1 above. This consensus sequence was then extended using repeated cycles of 
BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences 
discussed above. The extended consensus sequence is herein designated DN A36360. Based on the DNA36360 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 

25 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0943. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CGAGATGACGCCGAGCCCCC-3' (SEQ ID NO: 120) 
reverse PCR primer 5*-CGGTTCGACACGCGGCAGGTG-3' (SEQ ID NO: 121) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36360 
sequence which had the following nucleotide sequence 

5 , -TGCrGCTCCTGCTGCCGCCGCTGCTGCTGGGGGCCTTCCCGCCGG-3' (SEQ ID NO: 122) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0943 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0943 (designated herein as DNA52192- 1369 [Figure 69, SEQ ID NO : 1 1 8]) and the derived protein sequence 
for PR0943. 

The entire nucleotide sequence of DNA52192-1369 is shown in Figure 69 (SEQ ID NO: 1 18). Clone 
DNA52 192- 1369 contains a single open reading frame with an apparent translational initiation site at nucleotide 
5 positions 150-152 and ending at the stop codon at nucleotide positions 1662-1664 (Figure 69). The predicted 
polypeptide precursor is 504 amino acids long (Figure 70). The full-length PR0943 protein shown in Figure 
70 has an estimated molecular weight of about 54,537 daltons and a pi of about 10.04. Analysis of the full- 
length PR0943 sequence shown in Figure 70 (SEQ ID NO: 1 19) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 376 

10 to about amino acid 396, tyrosine kinase phosphorylation sites from about amino acid 212 to about amino acid 
219 and from about amino acid 329 to about amino acid 336, potential N-glycosylation sites from about amino 
acid 111 to about amino acid 114, from about amino acid 231 to about amino acid 234, from about amino acid 
255 to about amino acid 258 and from about amino acid 293 to about amino acid 296 and an immunoglobulin 
and MHC protein sequence homology block from about amino acid 219 to about amino acid 236. Clone 

15 DNA52192-1369 has been deposited with ATCC on July 1. 1998 and is assigned ATCC deposit no. 203042. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 70 (SEQ ID NO: 119), evidenced significant 
homology between the PR0943 amino acid sequence and the following Dayhoff sequences: B49151, A39752, 
FGR1JCENLA, S38579, RATHBFGFRBl, TVHU2F, FGR2MOUSE, CEK3CHICK, P_R21080 and 

20 A27171J. 

EXAMPLE 30: Isolation of cDNA clones Encoding Human PROl 188 

A consensus DNA sequence was assembled relative to other EST sequences using the program M phrap n 
as described in Example 1 above. This consensus sequence is designated herein as DNA45679. Based on the 
25 DNA45679 consensus sequence, oligonucleotides were synthesized: 1 ) to identify by PCR a cDN A library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence 
for PROl 188. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -CTGGTGCCTC AACAGGG AGC AG-3 ' (SEQ ID NO: 125) 
30 reverse PCR primer 5'-CCATTGTGCAGGTCAGGTCACAG-3* (SEQ ID NO:126) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45679 sequence which had the following nucleotide sequence: 
hybridization probe 

5 '-CTGGAGCAAGTGCTCAGCTGCCTGTGGTCAGACTGGGGTC-3 ' (SEQ ID NO: 127) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 188 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROU88 (designated herein as DNA52598-1518 [Figure 71, SEQ ID NO: 123]); and the derived protein 
sequence fo, PROl 188. 

The entire coding sequence of PROl 188 is shown in Figure 71 (SEQ ID NO: 123). Clone DNA52598- 
5 1518 contains a single open reading frame with an apparent translauonal initiation site at nucleotide positions 
136-138 and an apparent stop codon at nucleotide positions 3688-3690. The predicted polypeptide precursor 
is 1 184 amino acids long. The full-length PROl 188 protein shown in Figure 72 has an estimated molecular 
weight of about 132,582 Daltons and a pi of about 8.80. Additional features include: a signal peptide at about 
amino acids 1-31; an ATP/GTP binding site motif A (P-loop) at about amino acids 266-273; an aldehyde 
10 dehydrogenases cysteine active site at about amino acids 188-199; growth factor and cytokines receptors family 
signature 2 at about amino acids 153-159; and potential N-glycosylation sites at about amino acids 129-132, 132- 
135, 346-349, 420-423, 550-553, 631-634, 1000-1003, and 1056-1059. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 72 (SEQ ID NO: 124), revealed significant 
15 homology between the PROl 188 amino acid sequence and the following Dayhoff sequences: SSU83114_1, 
S56015, CET21B6_4, CELT19D2J. and TSPl_MOUSE. 

Clone DNA52598-1518 has been deposited with ATCC and is assigned ATCC deposit no 203107. 

EXAMPLE 31 : Isolation of cDNA clones Encoding Human PROl 133 

20 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This sequence was extended using repeated cycles of phrap. The extended consensus 
sequence is designated herein DNA38102. Based on the DNA38102 consensus sequence, oligonucleotides were 
synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as 
probes to isolate a clone of the full-length coding sequence for PROl 133. 

25 PCR primers (two forward and one reverse) were synthesized: 

forward PCR primer 1 5'-TCGATTATGGACGAACATGGC AGC-3 ' (SEQ ID NO:130); 
forward PCR primer 2 5'-TTCTGAGATCCCTCATCCTC-3' (SEQ ID NO: 131); and 
reverse primer 5 ' - AGGTTC AGGG AC AGC AAGTTTGGG-3 * (SEQ ID NO: 132). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 

30 DNA38102 sequence which had the following nucleotide sequence: 
hybridization probe 

5 , Tr^XK:TGGACCTCGGCTACGGAATTGGCTTCCCTCTACGGACAGCTGGAT3 , (SEQ ID NO: 133). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
35 isolate clones encoding the PROl 133 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01133 and the derived protein sequence for PROU33. 

The entire coding sequence of PROl 133 is shown in Figure 73 (SEQ ID NO: 128). Clone DNA53913- 
1490 contains a single open reading frame with an apparent translation^ initiation site at nucleotide positions 
266-268 and an apparent stop codon at nucleotide positions 1580-1582 of SEQ ID NO: 128. The predicted 
5 polypeptide precursor is 438 amino acids long . The signal peptide is at amino acids 1 - 1 8 of SEQ ID NO: 129. 
EGF-likc domain cysteine pattern signatures start at 315 and 385 of SEQ ID NO: 129 as shown in Figure 74. 
Clone DNA53913-1490 has been deposited with ATCC and is assigned ATCC deposit no. 203 162. The full- 
length PROl 133 protein shown in Figure 74 has an estimated molecular weight of about 49.260 daltons and a 
pi of about 6.15. 

10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 74 (SEQ ID NO: 129), revealed some sequence 
identity between the PROl 133 amino acid sequence and the following Dayhoff sequences (data from the database 
incorporated herein): AF002717J, LMG1HUMAN, B54665, UNC6_CAEEL, LML1CAEEL, 
LMA5_MOUSE, MMU88353J, LMA1_HUMAN, HSLN2C64J and AF0O5258J. 

15 

EXAMPLE 32: Isolation of cDNA clones Encoding Human PRQ784 

An initial DNA sequence (SEQ ID NO: 136), referred to herein as DNA44661 and shown in Figure 77, 
was identified using a yeast screen, in a human fetal lung cDNA library that preferentially represents the 5' ends 
of the primary cDNA clones. DNA44661 was then compared to ESTs from public databases (e.g., GenBank), 

20 and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA), using the computer 
program BLAST or BLAST2 [Altschul ct al. , Methods in Enzvmologv. 266:460-480 (1996)]. The ESTs were 
then clustered and assembled into a consensus DNA sequence using the computer program M phrap w (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained is designated herein as 
"DNA45463". Based on the DNA45463 consensus sequence, oligonucleotides were synthesized for use as 

25 probes to isolate a clone of the full-length coding sequence for PR0784 from a human fetal lung cDNA library. 

The full length DNA53978-1443 clone shown in Figure 75 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 37-39 and ending at the stop codon found at 
nucleotide positions 821-823 (Figure 75; SEQ ID NO: 134). The predicted polypeptide precursor (Figure 76, 
SEQ ID NO: 135) is 228 amino acids long. PR0784 has a calculated molecular weight of approximately 25,735 

30 Daltons and an estimated pi of approximately 5.45 . PR0784 has the following features: a signal peptide at about 
amino acid 1 to about 15; transmembrane domains at about amino acids 68 to about 87 and at about 183 to about 
204; potential N-myristoylation sites at about amino acids 15-20, 5 1 -56, 66-60, 1 63- 1 68 , and 206-2 1 1 ; and an 
RNP-1 protein RN A -binding region at about amino acids 108 to about 117. 

Clone DNA53978-1443 was deposited with ATCC on June 16, 1998, and is assigned ATCC deposit 

35 no. 209983. 

Based on a BLAST and FastA sequence alignment analysis (using the ALIGN computer program) of 
the full-length sequence, PR0784 shows amino acid sequence identity to the following proteins: RNU42209J , 
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MMU91538J, CGU91742J, CELF55A4J, SC22_YEAST, and F48188. 

EXAMPLE 33 : Isolation of cDNA Clones Encoding Human PRQ783 

A yeast screening assay was employed te identify cDNA clones thai encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDN A clone, designated herein as DNA4S201 
5 (Figure 80; SEQ ID NO: 139). 

The DNA45201 sequence was then used to search expressed sequence tag (EST) databases for the 
presence of potential homologies. The EST databases included public EST databases (e.g., Geofiank) and a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was 
performed using the computer program BLAST or BLAST2 (Altshul et al . , Methods in Enzvmologv 266:460- 

10 480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained is herein designated 
as "consenOl". A proprietary Genentech EST sequence was used in the consensus assembly and is herein 
designated as DNA 14575 (Figure 81; SEQ ID NO: 140). 

15 Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 

library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0783. In order to screen several libraries for a full-length clone, DNA from the libraries was 
screened by PCR amplification, as per Ausubel et al.. Current Protocols in Molecular Biology, with the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 

20 oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -G ACTGT ATCTG AGCCCC AG ACTGC-3 ' (SEQ ID NO: 141), 
forward PCR primer 5 ' -TC AGC AATGAGGTGCTGCTC-3 ' (SEQ ID NO: 142), and 
reverse PCR primer S'-TGAGGAAGATGAGGGACAGGTTGG-S ' (SEQ ID NO: 143). 

25 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consenOl 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 ' -T ATGG A AGC ACCTG ACT ACG A AGTGCT ATCCGTGCG AG AAC AGCT ATTCC-3 ' (SEQ ID NO: 144). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

30 screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0783 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB228). 
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 

33 coTitaining a Nod site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the SfU site; see. Holmes et al.. Science . 
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253 : 1278-1280 (1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0783 [herein designated as DNA53996-1442] (SEQ ID NO:137) and the derived protein sequence for 
PR0783. 

The entire nucleotide sequence of DNA53996-1442 is shown in Figure 78 (SEQ ID NO: 137). Clone 

5 DNA53996-1442 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 310-312 and ending at the stop codon at nucleotide positions 1777-1779 (Figure 78). The predicted 
polypeptide precursor is 489 amino acids long (Figure 79). The full-length PR0783 protein shown in Figure 
79 has an estimated molecular weight of about 55,219 daitons and a pi of about 8.47. Analysis of the full-length 
PR0783 sequence shown in Figure 79 (SEQ ID NO: 138) evidences the presence of the following features: 

10 transmembrane domains located at about amino acids 23-42, 67-89, 111-135, 154-176, 194-218, 296-319, 
348-370, 387-410 and 427-452; leucine zipper patterns located at about amino acids 263-283 and 399-420; a 
potential tyrosine kinase phosphorylation site at about amino acids 180-187; potential N-glycosylation sites at 
about amino acids 105 -108 and 121-124; potential cAMP- and a cGMP-dependent protein kinase 
phosphorylation site at about amino acids 288-291; and a region having sequence identity with bacterial 

15 rhodopsins retinal binding site protein at about amino acids 190-218. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35) shows some sequence identity 
between the PR0783 amino acid sequence and the following Dayhoff sequences: YNC2CAEEL, D64048, 
ATAC002332_3F4P9.3, NY2RSHEEP, and VSH_MUMPA. 

Clone DNA53996- 1442 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit 

20 no. 209921. 

EXAMPLE 34 : Isolation of cDNA Clones EncodmR Human PRQ82Q 

An expressed sequence tag (EST) DNA database (Merck/Wash. U) was searched and an EST designated 
EST no. AA504080, Merck clone 825 136, was identified (library 3 12, human B-cell tonsil) . Homology searches 
25 revealed that this EST showed sequence identity with low affinity immunoglobulin gamma Fc receptor II. DNA 
sequencing gave the full-length DNA sequence for PRO820 and the derived protein sequence for PRO820. 

The entire nucleotide sequence of DNA56041-1416 is shown in Figure 82 (SEQ ID NO:145). Clone 
DNA5 6041-1416 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 115-117 and ending at the stop codon at nucleotide positions 487-489 (Figure 82). The predicted 
30 polypeptide precursor is 124 amino acids long (Figure 83). The full-length PRO820 protein shown in Figure 
83 has an estimated molecular weight of about 14,080 daitons and a pi of about 7.48. Clone DNA5 6041-1416 
has been deposited with ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 146, the putative signal peptide is at about amino 
35 acids 1-15 of SEQ ID NO: 146. Protein kinase C phosphorylation sites are at about amino acids 20-22 and 43-45 
of SEQ ID NO: 146. An N-myrtstoylation site is at about amino acids 89-94 of SEQ ID NO: 146. An 
immunoglobulin and major histocompatibility complex domain is at about amino acids 83-90 of SEQ ID NO: 146. 
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The corresponding nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 35 : Isolation of cDN A Clones Encoding Human PRO108Q 

A consensus DNA sequence was assembled relative to other EST sequences using phrap and was - 
extended using repeated cycles of BLAST and phrap so as to extend the consensus sequence as far as possible 
5 using the sources of the EST sequences as described in Example 1 above. The consensus sequence is designated 
herein as DNA52640. An EST proprietary to Genentech was employed in the consensus assembly and is herein 
designated as DNA36527 (Figure 86; SEQ ID NO: 149). 

In light of an observed sequence homology between the DNA36527 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 526423, the Merck EST clone 526423 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 84 and is herein designated as DNA56047-1456. 

The entire nucleotide sequence of DNA56047-1456 is shown in Figure 84 (SEQ ID NO: 147). Clone 
DNA56047-1456 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 159-161 and ending at the stop codon at nucleotide positions 1233-1235 of SEQ ID NO: 147 (Figure 
15 84). The predicted polypeptide precursor is 358 aminoacids long (Figure85). The full-length PRO 1080 protein 
shown in Figure 85 has an estimated molecular weight of about 40,514 daltons and a pi of about 6.08. Clone 
DNA56047-1456 has been deposited with ATCC on June 9, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

20 Also shown in Figure 85 are the approximate locations of the signal peptide, cell attachment site, Nt- 

DnaJ domain signature, region having sequence identity with Nt-DnaJ domain proteins, and N-glycosylation 
sites. The corresponding nucleic acids of these amino acid sequences and others provided herein can be 
routinely determined by the information provided herein. 

25 EXAMPLE 36: Isolation of cDNA Clones Enco ding Human PRO1079 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above, and is herein designated DNA52714. Based on information provided by the assembly, the 
clone for Merck EST no. H06898 was obtained and sequenced, thereby giving the nucleotide sequence 
designated herein as DNA56050-1455. The entire nucleotide sequence of DNA56050-1455 is shown in Figure 

30 87 (SEQ ID NO: 150). Clone DNA56050-1455 contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 183-185 and ending at the stop codon at nucleotide positions 
861-863 (Figure 87). The predicted polypeptide precursor is 226 amino acids long (Figure 88). The full-length 
PRO1079 protein shown in Figure 88 has an estimated molecular weight of about 24,61 1 Daltons and a pi of 
about 4.85. Analysis of the full-length PRO1079 sequence shown in Figure 88 (SEQ ID NO:3) evidences the 

35 presence of the following features: a signal peptide at about amino acid 1-29; potential N-myristoylation sites 
at about amino acids 10-15, and 51-56; homology to photosystem I psaG and psaK proteins at about amino acids 
2 to 20; and homology to prolyl endopeptidase family serine proteins at about amino acids 150 to 163. 
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Analysis of the amino acid sequence of the full-length PRO 1079 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced some sequence identity between the PRO 1079 amino acid sequence and 
the following Dayhoff sequences: CEK10C3_4, MMU50734J, D69503, AF051149J, and VSMP_CVMS. 

Clone UNQ536 (DNA56050-1455) was deposited with the ATCC on June 22, 1998. and is assigned 
ATCC deposit no. 20301 1. 

5 

EXAMPLE 37 : Isolation of cDNA clones Encoding Human PRQ793 

A cDNA clone (DNA561 10-1437) encoding a native human PR0793 polypeptide was identified by a 
yeast screen, in a human skin tumor cDNA library that preferentially represents the 5' ends of the primary 
cDNA clones. The yeast screen employed identified a single EST clone designated herein as DNA50177 (Figure 

10 91 ; SEQ ID NO: 154). The DNA50177 sequence was then compared to various EST databases including public 
EST databases (e.g., GenBank), and a proprietary EST database (LIFESEQ 0 , Incyte Pharmaceuticals, Palo Alto, 
CA) to identify homologous EST sequences. The comparison was performed using the computer program 
BLAST or BLAST2 [Altschul et al., Methods in Enzvmolotrv. 266:460480 (1996)]. Those comparisons 
resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were 

15 clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). This consensus sequence is herein designated DNA50972. 

In light of an observed sequence homology between the DNA50972 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. N33874, the Merck EST clone N33874 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

20 The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA561 10-1437. 

The full-length DNA561 10-1437 clone shown in Figure 89 contains a single open reading frame with 
an apparent translation^ initiation site at nucleotide positions 77-79 and ending at the stop codon at nucleotide 
positions 491-493 (Figure 89). The predicted polypeptide precursor is 138 amino acids long (Figure 90). The 
full-length PR0793 protein shown in Figure 90 has an estimated molecular weight of about 15,426 daltons and 

25 a pi of about 10.67. Analysis of the M-length PR0793 sequence shown in Figure 90 (SEQ ID NO: 153) 
evidences the presence of the following: transmembrane domains from about amino acid 12 to about amino acid 
30, from about amino acid 33 to about amino acid 52, from about amino acid 69 to about amino acid 89 and 
from about amino acid 93 to about amino acid 109, potential N-myristolation sites from about amino acid 1 1 to 
about amino acid 16, from about amino acid 51 to about amino acid 56 and from about amino acid 1 16 to about 

30 amino acid 121 and an amino acid sequence block having homology to an aminoacyl-transfer RNA synthetase 
class-II protein from about amino acid 49 to about amino acid 59. Clone DNA561 10-1437 has been deposited 
with ATCC on August 11, 1998 and is assigned ATCC deposit no. 203113. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 153), evidenced certain 

35 homology between the PR0793 amino acid sequence and the following Dayhoff sequences: S47453, 
AF015193J2, MTEHGNS92, E64030. H69784, D64995. CD53_MOUSE, GEN8006, AE001I38_7 and 
COX2_STRPU. 
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EXAMPLE 38 : Isolation of cDNA Clones Encoding Human PRO1016 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus sequence obtained is herein designated DNA53502. 

In light of an observed sequence homology between the DNA53502 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 38680 . the Merck EST clone 38680 was purchased and 
5 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 92. 

The entire nucleotide sequence of DNA561 13-1378 is shown in Figure 92 (SEQ ID NO: 155). Clone 
DNA561 13-1378 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 168-170 and ending at the stop codon at nucleotide positions 1302-1304 (Figure 92). The predicted 
10 polypeptide precursor is 378 amino acids long (Figure 93). The full-length PRO1016 protein shown in Figure 
93 has an estimated molecular weight of about 44,021 daltons and a pi of about 9.07. Clone DNA561 13-1378 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 10 16 polypeptide suggests that portions of 
15 it possess sequence identity with acyltransferase, thereby indicating that PRO 10 16 may be a novel 
acyltransferase. 

Still analyzing the amino acid sequence of SEQ ID NO: 156, the putative signal peptide is at about amino 
acids 1-18 of SEQ ID NO:156. The transmembrane domain(s) are at about amino acids 332-352 and 305-330 
of SEQ ID NO: 156. The fructose-bisphosphate aldolase class-II protein homology sequence is at about amino 
20 acids 73-90 of SEQ ID NO: 156. The extradiol ring-cleavage dioxygenase protein is at about amino acids 252- 
275 of SEQ ID NO: 156. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

The specific Dayhoff database designation names of sequences to which PRO 101 6 has sequence identity 
with include the following: S52645, PR59712, P_R99249, P_R59713, BNAGPATRFJ, CELT05H4J5 and 
25 CELZK40J. 

EXAMPLE jfrlwlation, of cPNA Encoding Human PRQ1Q13 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus DNA sequence was then extended using repeated cycles of BLAST and 

30 phrap to extend the consensus sequence as far as possible using the sources of EST sequences. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 3107695, the Incyte EST clone 3107695 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 94 and is herein designated as DNA56410-1414. 

35 The entire nucleotide sequence of DNA56410-1414 is shown in Figure 94 (SEQ ID NO:157). Clone 

DNA56410-1414 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 17-19 and ending at the stop codon at nucleotide positions 1244-1246 (Figure 94). The predicted 
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polypeptide precursor is 409 amino acids long (Figure 95). The full-length PRO 10 13 protein shown in Figure 
95 has an estimated molecular weight of about 46,662 daltons and a pi of about 7. 18. Clone DNA56410-1414 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 1 58, the putative signal peptide is at about amino 
5 acids 1-19 of SEQ ID NO: 158. N-glycosylation sites are at about amino acids 75-78 and 322-325 of SEQ ID 
NO: 158. An N-myristoylation site is at about amino acids 184- 189 of SEQ ID NO: 158. A growth factor and 
cytokine receptor family domain is at about amino acids 134-149 of SEQ ID NO: 158. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

Blast analysis showed some sequence identity with other proteins. Specifically, PRO 10 13 has some 
10 sequence identity with at least the Dayhoff sequences designated: D63877J; MHU22019_1, AE000730J0, and 
AF019079J. 

EXAMPLE 40 : Isolation of cDNA Clones Encoding Human PRQ937 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. That consensus sequence is herein designated DNA49651. Based on the DNA49651 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0937. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5 , -CTCCGTGGTAAACCCCACAGCCC-3' (SEQ ID NO: 161); and 
reverse PCR primer 5 ' -TC AC ATCG ATGGG ATCC ATG ACCG-3 * (SEQ ID NO: 162). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48651 sequence 
which had the following nucleotide sequence: 

25 5'-GGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAG-3 f (SEQ ID NO: 163). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0937 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0937 [herein designated as DNA56436- 14481 (SEQ ID NO: 159) and the derived protein sequence for 
PR0937. 

The entire nucleotide sequence of DNA56436-1448 is shown in Figure 96 (SEQ ID NO: 159). It 
contains a single open reading frame having an apparent translational initiation site at nucleotide positions 499- 
35 501 and ending at the stop codon found at nucleotide positions 2167-2169 (Figure 96, SEQ ID NO:159). The 
predicted polypeptide precursor is 556 amino acids long, has a calculated molecular weight of approximately 
62,412 daltons and an estimated pi of approximately 6.62. Analysis of the full-length PR0937 sequence shown 
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in Figure 97 (SEQ ID NO: 160) evidences the presence of the following features: signal peptide at about amino 
acids 1-22; ATP/GTP-binding site motif A (P-loop) at about amino acids 515-523; a potential N-gJycosylation 
site at about amino acids 514-517; and sites of glypican homology at about amino acids 54-74, 106-156, 238- 
279. 309-345,423^59. and 468-505. 

Clone DNA56436-1448 has been deposited with ATCC on May 27, 1998, and is assigned ATCC 
5 deposit no. 209902. 

Analysis of the amino acid sequence of the full-length PR0937 polypeptide suggests that it possesses 
significant sequence similarity to glypican proteins, thereby indicating that PR0937 may be a novel glypican 
protein. More specifically, an analysis of the Dayhoff database (version 35,45 SwissProt 35) evidenced 
significant homology between the PR0937 amino acid sequence and the following Dayhoff sequences: 
10 GPCK_MOUSE, GPC2__RAT, GPC5_HUMAN, GPC3_HUMAN, P_R30168. CEC03H12J2, GEN13820, 
HS119E23J, HDAC_DROME, and AF017637J. 

EXAMPLE 41 : Isolation of cDNA clones Encoding Human PRQ842 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence designated herein as Incyte EST cluster sequence no. 69572. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et ah, Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 

20 BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap* (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA54230. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Merck EST clone no. AA477092, the Merck EST clone AA477092 was purchased and 

25 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 98 and is herein designated as DNA56855-1447. 

The full length clone shown in Figure 98 contained a single open reading frame with an apparent 
transitional initiation site at nucleotide positions 153-155 and ending at the stop codon found at nucleotide 
positions 510-512 (Figure 98; SEQ ID NO: 164). The predicted polypeptide precursor (Figure 99, SEQ ID 

30 NO: 165) is 1 19 amino acids long. PR0842 has a calculated molecular weight of approximately 13,819 Daltons 
and an estimated pi of approximately 11.16. Other features of PR0842 include a signal peptide at about amino 
acids 1-22, a potential protein kinase C phosphorylation site at about amino acids 39-41 and two potential N- 
myristoylation sites at about amino acids 27-32 and about amino acids 46-51. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 164), evidenced some homology 
between the PROS42 amino acid sequence and the following Dayhoff sequences: CE2K13111, P R80843, 
RAT5HT2XJ, S81882J, A60912, MCU60315J37MC137L, U93422J, p_P9I996, U93462J, and 
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ZN18_HUMAN. 

Clone DNA56855-1447 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no. 203004. 

EXAMPLE 42 : Isolation of cDNA clones Encoding Human PRQ839 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte LIFESEQ® database, designated Incyte EST Cluster No. 24479. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 
public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
10 the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmologv 266:460-480 (\ 996^. Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap* (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55709. 

15 In light of an observed sequence homology between the DNA55709 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. 754525, the Merck EST clone 754525 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 100 and is herein designated as DNA56859-1445. 

The full length clone shown in Figure 100 contained a single open reading frame with an apparent 

20 translational initiation site at nucleotide positions 2-4 and ending at the stop codon found at nucleotide positions 
263-265 (Figure 100; SEQ ID NO: 166). The predicted polypeptide precursor (Figure 101, SEQ ID NO: 167) 
is 87 amino acids long. PR0839 has a calculated molecular weight of approximately 9,719 Daitons and an 
estimated pi of approximately 4.67. Other features of PR0839 include a signal peptide at about amino acids 1- 
23, potential protein kinase C phosphorylation sites at about amino acids 37-39 and about amino acids 85-87, 

25 a potential casein kinase II phosphorylation site at about amino acids 37-40, sequence identity with ribonucleotide 
reductase large subunit protein at about amino acids 50-60, and sequence identity with eukaryotic RNA-binding 
region RNP-1 proteins at about amino acids 70-79. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 101 (SEQ ID NO: 167), evidenced some 

30 homology between the PR0839 amino acid sequence and the following Dayhoff sequences: CD14MOUSE, 
XPR6_YARU, HS714385J, S49783, BB19RABIT, GVPH-HALME, AB003135J, P_R85453, 
LUU27081_2, and TP2B_MOUSE. 

Clone DNA56859-1445 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no.209019. 

35 
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B X AMf HrB . tt : Isolatioii of cPNA Clones Encoding Human PROl 180 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte EST cluster sequence no. 14732). The Incyte EST cluster sequence no. 
14732 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo 
5 Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al. , Methods in Enzvraolopv 266 :460480 ( 1996)). Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA5571 1 . 

10 In light of an observed sequence homology between the DNA5571 1 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. T60981 , the Merck EST clone T60981 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 102 and is herein designated DNA5686O-1510. 

The full length clone shown in Figure 102 contained a single open reading frame with an apparent 

1 5 translation^ initiation site at nucleotide positions 78-80 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 102; SEQ ID NO: 168). The predicted polypeptide precursor is 277 amino acids long, has a 
calculated molecular weight of approximately 31,416 daltons and an estimated pi of approximately 8.88. 
Analysis of the full-length PROl 180 sequence shown in Figure 103 (SEQ ID NO: 169) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a leucine zipper pattern 

20 sequence from about amino acid 10 to about amino acid 3 1 , and potential N -myristolation sited from about amino 
acid 64 to about amino acid 69, from about amino acid 78 to about amino acid 83, from about amino acid 80 
to about amino acid 85, from about amino acid 91 to about amino acid 96 and from about amino acid 201 to 
about amino acid 206. Clone DNA56860-1510 has been deposited with the ATCC on June 9, 1998 and is 
assigned ATCC deposit no. 209952. 

25 Analysis of the amino acid sequence of the full-length PROl 1 80 polypeptide suggests that it possesses 

sequence similarity to the methyltransferase family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PROl 180 amino acid 
sequence and the following Dayhoff sequences, MTCI65J4, D69267, YH09_ YEAST, BIOC_SERMA, 
ATAC00448415T1D16.16. SHGCPIRJ8. SPBC3B9_4, AB009504J4, P W17977 and A69952. 

30 

EXAMPLE 44: Matjc-fl of cDNA clones finco<lffl8 Hwqan PROl 134 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 75 1 1 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
35 a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. f Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55725. Two proprietary Genentech EST 
sequences were employed in the assembly and are shown in Figure 106 (SEQ ID NO: 172) and Figure 107 JSEQ 
ID NO: 173). 

5 In light of an observed sequence homology between the DNA55725 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H94897, the Merck EST clone H94897 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 106 and is herein designated as DNA56865-1491 . 

Clone DNA56865-1491 contains a single open reading frame with an apparent translational initiation 

10 site at nucleotide positions 153-155 and ending at the stop codon at nucleotide positions 1 266-1 268 (Figure 104). 
The predicted polypeptide precursor is 371 amino acids long (Figure 105). The full-length PRO 1 134 protein 
shown in Figure 105 has an estimated molecular weight of about 4 1 ,935 daltons and a pi of about 9.58. Analysis 
of the full-length PRO 11 34 sequence shown in Figure 105 (SEQ ID NO: 171) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 23, potential N-glycosylation sites from 

15 about amino acid 103 to about amino acid 106, from about amino acid 249 to about amino acid 252 and from 
about amino acid 257 to about amino acid 260, and an amino acid block having homology to tyrosinase CuA- 
binding region proteins from about amino acid 280 to about amino acid 306. Clone DNA56865-1491 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203022. 

An analysis of the Dayhoff database (version 35.45 Swiss Prot 35), using a WU-BLAST-2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 105 (SEQ ID NO: 171), evidenced significant 
homology between the PRO 11 34 amino acid sequence and the following Dayhoff sequences: F20P5_18. 
AC002396J0, S47847, C64146, GSPA_BACSU. P_W10564, RFAI_ECOLI, Y258_HAEIN, RFAJSALTY 
and P_R32985. 

25 gCAMg^.45: Isolation of cDNA clones Encoding Human PRO830 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incytedatabase, designated 20251. This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and 
a proprietary EST DNA database (UFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

30 homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55733. 

35 In light of an observed sequence homology between the DNA55733 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H78534, the Merck EST clone H78534 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 108 and is herein designated as DNA56866-1342. 

Clone DNA56866-1342 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 154-156 and ending at the stop codon at nucleotide positions 415-417 (Figure 108). 
The predicted polypeptide precursor is 87 amino acids long (Figure 109). The full-length PRO830 protein shown 
in Figure 109 has an estimated molecular weight of about 9,272 da] tons and a pi of about 9. 19. Analysis of the 
5 full-length PRO830 sequence shown in Figure 109 (SEQ ID NO: 175) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 33, potential N-myristoylation sites from about 
amino acid 2 to about amino acid 7 and from about amino acid 8 to about amino acid 13 and a thioredoxin family 
of proteins homology block from about amino acid 23 to about amino acid 39. Clone UNQ470 (DNA56866- 
1342) has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203023. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 109 (SEQ ID NO: 175), evidenced significant 
homology between the PRO830 amino acid sequence and the following Dayhoff sequences: HSU88154J, 
HSU88153J, SAPKSGENE I, HPU31791J, GGCNOT21, CPU91421J, CHKESTPC09J, PQ0769, 
U97553J79 and B 60095. 

15 

EXAMPLE 46: Isolation of cDNA clones Encoding Human PROl 1 15 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, designated Incyte EST cluster sequence no. 165008. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 

20 public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmology 266:460-480 ( 1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 

25 University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55726. 

In light of an observed sequence homology between the DNA55726 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R75784, the Merck EST clone R75784 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure lit and is herein designated as DNA56868-1478. 

The full length clone shown in Figure 1 10 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 189-191 and ending at the stop codon found at nucleotide 
positions 1524-1526 (Figure 110; SEQ ID NO: 176). The predicted polypeptide precursor (Figure 111, SEQ 
ID NO: 177) is 445 amino acids long. PROl 115 has a calculated molecular weight of approximately 50,533 

35 Daltons and an estimated pi of approximately 8.26. Additional features include a signal peptide at about amino 
acids 1-20; potential N-glycosylanon sites at about amino acids 204-207, 295-298, and 313-316; and putative 
transmembrane domains at about amino acids 35-54 , 75-97, 126-146, 185-204, 333-350, and 353-371. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 111 (SEQ ID NO: 177), evidenced some amino 
acid sequence identity between the PROU15 amino acid sequence and the following Dayhoff sequences: 
AF053947 79, S73698, CEC47A10_4, CCOMTNDS5GJ, HS4LMP2AC1, LMP2_EBV, PA24_MOUSE, 
HCU33331_7, P-W05508, and AF002273J. 
5 Clone DNA56868- 1 478 was deposited with the ATCC on June 23, 1998 and is assigned ATCC deposit 

no. 203024.. 

EXAMPLE 47: Isolation of cDNA clones Encoding Human PRQ1277 

A consensus DNA sequence was assembled relative to other ESTs using repeated cycles of BLAST and 
10 the program "phrap" as described in Example I above. One or more of the ESTs from the assembly was 
derived from diseased coronary artery tissue. The consensus sequence obtained is designated herein as 
"DNA49434". 

In light of an observed sequence homology between the DNA49434 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3042605, the Incyte EST clone 3042605 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 112 (SEQ ID NO: 178). 

Clone DNA56869-1545 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 188-190, and an apparent stop codon at nucleotide positions 2222-2224 (Figure 1 12). 
The predicted polypeptide precursor is 678 amino acids long (Figure 113). The full-length PRO 1277 protein 
20 shown in Figure 113 has an estimated molecular weight of about 73,930 daltons and a pi of about 9.48. 
Additional features include a signal peptide at about amino acids 1-26; a transmembrane domain at about amino 
acids 181-200, and potential N-glycosylation sites at about amino acids 390-393 and 520-523, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 113 (SEQ ID NO: 179), revealed significant 
25 homology between the PR01277 amino acid sequence and Dayhoff sequence no AF012252_1 . Homology was 
also found berween the PRO 1277 amino acid sequence and the following Dayhoff sequences: AF006740J, 
CA36_HUMAN. HSU1J. HUMCOL7A1XJ, CA17 HUMAN, MMZ78163J, CAMA_CHICK, 
HSU69263 1, YNX3 CAEEL, and MMRNAM3J. 

Clone DNA56869-1545 has been deposited with ATCC and is assigned ATCC deposit no. 203161. 

30 

EXAMPLE 48: Isolation of cDNA Clones Encoding Human PRO 1 1 35 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52767. Based on the DNA52767 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROU35. 
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In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with PCR primer pairs prepared based upon the DNA52767 sequence. A positive 
library was then used to isolate clones encoding the PROl 135 gene using the probe oligonucleotide and one of 
the PCR primers. RNA for construction of the cLN A libraries was isolated from human coronary artery smooth 
muscle tissue (LIB309). The cDNA libraries used to isolate the cDNA clones were constructed by standard 

5 methods using commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was 
primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with Not I, 
sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such 
as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et ah, 
Science . 252:1278-1280 (1991)) in the unique Xhol and NotI sites. 

10 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PRO 11 35 [herein designated as DNA56870-1492] (SEQ ID NO: 180) and the derived protein sequence for 
PROl 135. 

The entire nucleotide sequence of DNA56870-1492 is shown in Figure 1 14 (SEQ ID NO: 180). Clone 
DNA56870-1492 contains a single open reading frame with an apparent translational initiation site at nucleotide 

15 positions 62-64 and ending at the stop codon at nucleotide positions 1685-1687 (Figure 114). The predicted 
polypeptide precursor is 541 amino acids long (Figure 1 15). The full-length PROl 135 protein shown in Figure 
1 15 has an estimated molecular weight of about 60,335 daltons and a pi of about 5.26. Analysis of the full- 
length PROl 135 sequence shown in Figure 115 (SEQ ID NO: 18 1) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about aino acid 2 1 , potential N-glycosylation sited from about amino 

20 acid 53 to about ammo acid 56, from about amino acid 75 to about amino acid 78, from about amino acid 252 
to about amino acid 255 and from about amino acid 41 3 to about amino acid 4 16 and an amino acid block having 
homology to glycosyl hydrolase family 35 proteins from about amino acid 399 to about amino acid 414. Clone 
DNA56870-1492 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 209925. 
Analysis of the amino acid sequence of the full-length PROl 135 polypeptide suggests that it possesses 

25 significant sequence similarity to the alpha 1 ,2-mannosidase protein, thereby indicating that PROl 135 may be 
a novel mannosidase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PROl 135 amino acid sequence and the following Dayhoff 
sequences, DMC86E4J, D86967J, SPAC23Al_4,YH04_YEAST,B544O8,SSMAN9MAN_I,CEZC410_4, 
S61631 andMSU14190J. 

30 

EXAMPLE 49 : Isolation of cDNA Clones Encoding Human PROl 1 14 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
BLAST-2 sequence alignment computer program, to have certain sequence identity to other known interferon 
receptors. This cDNA sequence is herein designated DNA48466 and is shown in Figure 1 18 (SEQ ID NO: 184). 
35 Based on the sequence identity, probes were generated from the sequence of the DNA48466 molecule and used 
to screen a human breast carconoma library (LIB135) prepared as described in paragraph 1 of Example 2 above. 
The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes 
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et al., Science . 253 : 1278-1280 (1991)), and the cDNA size cat was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5'-AGGCTTCGCTGCGACTAGACCTC-3' (SEQ ID NO: 185) 
reverse PCR primer 5 ' -CC AGGTCGGGTAAGG ATGGTTG AG-3 ' (SEQ ID NO: 186) 
hybridization probe 

5 5 -TTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC-3' (SEQ ID NO: 187) 
A full length clone was identified that contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 250-252, and a stop signal at nucleotide positions 1 183- 11 85 
(Figure 116, SEQ ID NO: 182). The predicted polypeptide precursor is 31 1 amino acids long, has a calculated 
molecular weight of approximately 35,076 daltons and an estimated pi of approximately 5.04. Analysis of the 

1 0 full-length PRO 1 1 14 interferon receptor sequence shown in Figure 1 1 7 (SEQ ID NO: 183) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 29, a transmembrane domain 
from about amino acid 230 to about amino acid 255, potential N-glycosylation sites from about amino acid 40 
to about amino acid 43 and from about amino acid 134 to about amino acid 137, an amino acid sequence block 
having homology to tissue factor proteins from about amino acid 92 to about amino acid 1 19 and an amino acid 

1 5 sequence block having homology to integrin alpha chain proteins from about amino acid 232 to about amino acid 
262. Clone DNA57033-I403 has been deposited with ATCC on May 27, 1998 and is assigned ATCC deposit 
no. 209905. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 17 (SEQ ID NO: 183), evidenced significant 
20 homology between the PRO 1 1 14 interferon receptor amino acid sequence and the following Dayhoff sequences : 
G01418. INRl_MOUSE, P_R71035, INGS HUMAN, A26595J, A26593J. 156215 and TFHUMAN. 

EXAMPLE SQ- IwlSKWn of cDNA Clones Encoding Human PRQ828 

A consensus DNA sequence was identified using the method described in Example 1 above. This 
25 consensus sequence is herein designated DNA35717. Based on the DNA35717 consensus sequence, 

oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of interest, 

and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0828. 
PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 ' -GC AGG ACTTCTACGACTTC AAGGC-3 ' (SEQ ID NO: 190); and 
30 reverse PCR primer 5 ' ■ AGTCTGGGCC AGGTACTTGAAGGC-3 ' (SEQ ID NO: 191). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35717 

sequence which had the following nucleotide sequence: 

hrbrifeatfon probe 

5 , -CAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGT-3* (SEQ ID NO: 192) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0828 gene using the probe oligonucleotide and one of the PCR primers. RNA 
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for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0828 [herein designated as DNA57037-1444] (SEQ ID NO: 188) and the derived protein sequence for 
PR0828. 

The entire nucleotide sequence of DNA57037-1444 is shown in Figure 119 (SEQ ID NO:188). Clone 
5 DNA57037-1444 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 34-36 and ending at the stop codon at nucleotide positions 595-597 (Figure 119). The predicted 
polypeptide precursor is 187 amino acids long (Figure 120). The full-length PR0828 protein shown in Figure 
120 has an estimated molecular weight of about 20,996 daltons and a pi of about 8.62. Analysis of the full- 
length PR0828 sequence shown in Figure 120 (SEQ ID NO: 189) evidences the presence of the following: a 

10 signal peptide at about amino acids 1-21; sequences identity to glutathione peroxidases signature 2 at about 
amino acids 82-89; sequence identity to glutathione peroxidases selenocysteine proteins at about amino acids 35- 
60. 63-100, 107-134, and 138-159. Clone DNA57037-1444 has been deposited with ATCC on May 27, 1998, 
and is assigned ATCC deposit no. 209903. 

Analysis of the amino acid sequence of the full-length PR0828 polypeptide suggests that it possesses 

15 significant sequence similarity to glutathione peroxidases, thereby indicating that PR0828 may be a novel 
peroxidase enzyme. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0828 amino acid sequence and the following Dayhoff sequences: 
AF05331M. CELT09A12_2, AC004151_3, BTUE_ECOLI, CER05H103, PJ>80918, PWU889Q7J, and 
P_W22308. 

20 

EXAMPLE 51: Ration of cDNA clones Encoding HMmffl PRPIOO? 

A cDNA clone (DNA57 129-1413) encoding a native human PRO1009 polypeptide was identified by 

the use of a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that preferentially 

represents the 5' ends of the primary cDNA clones. First SEQ ID NO: 195 (Figure 123) was identified, which 
25 was extended by alignments to other EST sequences to form a consensus sequence. Oligonucleotide probes 

based upon the consensus sequence were synthesized and used to screen the cDNA library which gave rise to 

the full-length DNA57129-1413 clone. 

The full length DNA57129-1413 clone shown in Figure 121 contained a single open reading frame with 

an apparent translational initiation site at nucleotide positions 41-43 and ending at the stop codon found at 
30 nucleotide positions 1886-1888 (Figure 121; SEQ ID NO:193). The predicted polypeptide precursor (Figure 

122, SEQ ID NO: 194) is 615 amino acids long. Figure 122 also shows the approximate locations of the signal 

sequence, transmembrane domains, myristoylation sites, a glycosylation site and an AMP-binding domain. 

PRO 1009 has a calculated molecular weight of approximately 68,125 daltons and an estimated pi of 

approximately 7.82. Clone DNA57129-1413 has been deposited with ATCC and is assigned ATCC deposit no. 
3 5 209977 . It is understood that the deposited clone has the actual and correct sequence and that the representations 

herein may have minor, normal sequencing errors. 
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Based on a WU-BLAST-2 sequence alignment analysis (using the ALIGN computer program) of the 
full-length sequence, PRO1009 shows amino acid sequence identity to at least the following proteins which were 
designated in a Dayhoff database as follows: F69893, CEF28F8J2, BSY13917J7, BSY13917J7, D69187, 
D69649, XCRPFBJ, E64928, YDID_ECOLl, BNACSF8J and RPU75363_2. 

5 EXAMPLE 52 : Isolation of cDNA Clones Enc oding Human PRO 1007 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated as DNA40671. 

In light of an observed sequence homology between the DNA40671 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T70513, the Merck EST clone T70513 was purchased 
10 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 124. 

The entire nucleotide sequence of DNA57690-1374 is shown in Figure 124 (SEQ ID NO: 196) Clone 
DNA5769CM374 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 16-18 and ending at the stop codon at nucleotide positions 1054-1056 (Figure 124). The predicted 
15 polypeptide precursor is 346 amino acids long (Figure 125). The full-length PRO1007 protein shown in Figure 
125 has an estimated molecular weight of about 35.971 daltons and a pi of about 8. 17. Clone DNA57690-1374 
has been deposited with the ATCC on June 9, 1998. Regarding the sequence, it is understood that the deposited 
clone contains the actual sequence, and the sequences provided herein are based on known sequencing 
techniques. The representative figures herein show the representative numbering. 
20 Analysis of the arnino acid sequence of the full-length PRO 1007 polypeptide suggests that portions of 

it possess sequence identity to MAGPLAP, thereby indicating that PRO 1007 may be a novel member of the 
family to which MAGPIAP belongs. 

Still analyzing the amino acid sequence of SEQ ID NO: 197, the putative signal peptide is at about amino 
acids 1-30 of SEQ ID NO: 197. The transmembrane domain is at amino acids 325-346 of SEQ ID NO: 197. N- 
25 glycosylation sites are at about amino acids 118-121, 129-132, 163-166. 176-179, 183-186 and 227- 1 30 of SEQ 
ID NO: 197. Ly-6/u-Par domain protein homology is at about amino acids 17-36 and 209-222 of SEQ ID 
NO: 197. The corresponding nucleotides of the amino acids presented herein can be routinely determined given 
the sequences provided herein. 

30 EXAMPLE 53: Isolation of cDNA c l o nes Encoding Human PRO 1056 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 6425. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto. CA) to identify 

35 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al.. Methods in Enzvmologv 266:46(M80 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
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consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA55736. 

In light of an observed sequence homology between the DNA55736 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R88049, the Merck EST clone R88049 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

5 The sequence of this cDNA insert is shown in Figure 126 and is herein designated as DNA57693-1424. 

Clone DNA57693-1424 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 56-58 and ending at the stop codon at nucleotide positions 416-418 (Figure 126). The 
predicted polypeptide precursor is 120 amino acids long (Figure 127). The full-length PRO1056 protein shown 
in Figure 127 has an estimated molecular weight of about 13,345 daltons and a pi of about 5.18. Analysis of 

10 the full-length PRO1056 sequence shown in Figure 127 (SEQ ID NO: 199) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 18, a transmembrane domain from about 
amino acid 39 to about amino acid 58, a potential N-glycosylation site from about amino acid 86 to about amino 
acid 89, protein kinase C phosphorylation sites from about amino acid 36 to about amino acid 38 and from about 
amino acid 58 to about amino acid 60, a tyrosine kinase phosphorylation site from about amino acid 25 to about 

15 amino acid 32 and an amino acid sequence block having homology to channel forming colicin proteins from 
about amino acid 24 to about amino acid 56. Clone DNA57693- 1424 has been deposited with ATCC on June 
23. 1998 and is assigned ATCC deposit no. 203008. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 127 (SEQ ID NO: 199). evidenced significant 

20 homology between the PRO1056 amino acid sequence and the following Dayhoff sequences: PLMHUMAN, 
A40533, ATNG_HUMAN, A55571, ATNG_SHEEP, S31524, GEN13025, RIC_MOUSE, A48678 and 
A10871J. 

EXAMPLE 54 : Isolation of cDNA clones Encoding Human PRQ826 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 47283. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (UFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

30 et al . , Methods in Enzvmologv 266:460-480 ( 1996)) . Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56000. 

In light of an observed sequence homology between the DNA56000 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. W69233, the Merck EST clone W69233 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 128 and is herein designated as DNA57694-1341. 
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Clone DNA57694-1341 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 13-15 and ending at the stop codon at nucleotide positions 310-312 (Figure 128). The 
predicted polypeptide precursor is 99 amino acids long (Figure 129). The full-length PR0826 protein shown 
in Figure 129 has an estimated molecular weight of about 1 1,050 daltons and a pi of about 7.47. Analysis of 
the full-length PR0826 sequence shown in Figure 129 (SEQ ID NO:201) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 22, potential N-myristoylation sites from 
about amino acid 22 to about amino acid 27 and from about amino acid 90 to about amino acid 95 and an amino 
acid sequence block having homology to peroxidase from about amino acid 16 to about amino acid 48. Clone 
DNA57694-1341 has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203017. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 129 (SEQ ID NO: 201), evidenced significant 
homology between the PR0826 amino acid sequence and the following Dayhoff sequences: CCU12315_1, 
SCU96108_6, CELF39F10_4 and HELT_HELHO. 

EXAMPLE 55 : Isolation of cDNA clones Encoding Human PRQ819 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 49605. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program u phrap* (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56015. 

In light of an observed sequence homology between the DNA56015 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. H65785, the Merck EST clone H65785 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 130 and is herein designated as DNA57695-1340. 

Clone DNA57695-1340 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 202-204 (Figure 1 30). The 

30 predicted polypeptide precursor is 52 amino acids long (Figure 131). The full-length PR0819 protein shown 
in Figure 131 has an estimated molecular weight of about 5,216 daltons and a pi of about 4.67. Analysis of the 
full-length PROS 19 sequence shown in Figure 131 (SEQ ID NO: 203) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a potential N-myristoylation site from about 
amino acid 2 to about amino acid 7 and a region having homology to immunoglobulin light chain from about 

35 amino acid 5 to about amino acid 33. Clone DNA57695-1340 has been deposited with ATCC on June 23, 1998 
and is assigned ATCC deposit no. 203006. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 131 (SEQ ID NO: 203), evidenced significant 
homology between the PR0819 amino acid sequence and the following Dayhoff sequences: HSU03899_1, 
HUMIGLITEBJ, YG28_HSVSA, AF031522J, PADl_YEAST and AF045484J. 

5 EXAMPLE 56 : Isolation of cDNA Clones Encoding Human PRO1006 

An initial candidate sequence from Incyte cluster sequence no. 45748 was identified using the signal 
algorithm process described in Example 3 above. This sequence was then aligned with a variety of public and 
Incyte EST sequences and a consensus sequence designated herein as DNA56036 was derived therefrom. 

In light of an observed sequence homology between the DNA56036 consensus sequence and an EST 
10 sequence encompassed within the Merck EST clone no. 489737, the Merck EST clone 489737 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 132. 

The entire nucleotide sequence of DNA57699-1412 is shown in Figure 132 (SEQ ID NO:204). Clone 
DNA57699-1412 contains a single open reading frame with an apparent translational initiation site at nucleotide 
15 positions 28-30 and ending at the stop codon at nucleotide positions 1204-1206 (Figure 132), The predicted 
polypeptide precursor is 392 amino acids long (Figure 133). The full-length PRO1006 protein shown in Figure 
133 has an estimated molecular weight of about 46, 189 daltons and a pi of about 9.04. Clone DNA57699-I412 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein arc based on known sequencing techniques. 
20 Analyzing the amino acid sequence of SEQ ID NO:205, the putative signal peptide is at about amino 

acids 1-23 of SEQ ID NO:205. The N-glycosylation sites are at about amino acids 40-43, 53-56, 204-207 and 
373-376 of SEQ ID NO:205. An N-myristoylation site is at about amino acids 273-278 of SEQ ID NO:205. 

The corresponding nucleotides of these amino acid regions and others can be routinely determined given the 
sequences provided herein. 

25 

EXAMPLE 57 : Isolation of cDN A Clones Encoding Human PRO 1112 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a specific 
EST cluster sequence. This EST cluster sequence was then compared to a variety of expressed sequence tag 
(EST) databases which included public EST databases (e.g.. GenBank) and a proprietary EST DNA database 

30 (LIFESEQ 0 , Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The homology search was 
performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460- 
480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is 

35 herein designated DNA56018. 

In light of an observed sequence homology between the DNA56018 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA223546, the Merck EST clone AA223546 was 
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purchased and the cDNA insert was obtained and sequenced. It was found that this insen encoded a fuil-length 
protein. The sequence of this cDNA insert is shown in Figure 134 and is herein designated as DNA57702-1476. 

The entire nucleotide sequence of DNA57702-1476 is shown in Figure 134 (SEQ ID NO:206). Clone 
DNA57702- 1476 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 20-22 and ending at the stop codon at nucleotide positions 806-808 of SEQ ID NO: 206 (Figure 134). 
5 The predicted polypeptide precursor is 262 amino acids long (Figure 135). The full-length PRO 1 112 protein 
shown in Figure 135 has an estimated molecular weight of about 29,379 daltons and a pi of about 8.93. Figure 
135 also shows the approximate locations of the signal peptide and transmembrane domains. Clone DNA57702- 
1476 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone has the actual 
nucleic acid sequence and that the sequences provided herein are based on known sequencing techniques. 

10 Analysis of the amino acid sequence of the full-length PROl 1 12 polypeptide suggests that it possesses 

some sequence similarity to other proteins. More specifically, an analysis of the Dayhoff database (version 
35.45 SwissProt 35) evidenced some sequence identity between the PROl 1 12 amino acid sequence and at least 
the following Dayhoff sequences, MTY20B1M3 (a mycobacterium tuberculosis peptide), F64471, 
AE000690_6, XLU16364 1, E43259 (H + -transporting ATP synthase) and PIGSLADRXEJ (MHC class II 

15 histocompatibility antigen). 

EXAMPLE 58 : Isolation of cDNA clones Encoding Human PRO1074 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte cluster sequence No. 42586). This cluster sequence was then compared to 

20 a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and 
a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al„ Methods in Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

25 DNA sequence with the program "phrap" (Phil Green, Univ. of Washington. Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56251. 

In light of an observed sequence homology between the DNA56251 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA081912, the Merck EST clone AA081912 was 
purchased and the cDNA insen was obtained and sequenced. It was found thai this insert encoded a full-length 

30 protein. The sequence of this cDNA insert is shown in Figure 136 and is the full-length DNA sequence for 
PRO1074. Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC 
deposit no. 209953. 

The entire nucleotide sequence of DNA57704-1452 is shown in Figure 136 (SEQ ID NO:208). Clone 
DNA57704-1452 contains a single open reading frame with an apparent translational initiation site at nucleotide 
35 positions 322-324 and ending at the stop codon at nucleotide positions 1315-1317 (Figure 136). The predicted 
polypeptide precursor is 331 amino acids long (Figure 137). The full-length PRO 1074 protein shown in Figure 
137 has an estimated molecular weight of about 39,512 Daltons and a pi of about 8.03. Analysis of the full- 
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length PRO1074 sequence shown in Figure 137 (SEQ ID NO:209) evidences ihe presence of the following 
features: a transmembrane domain at about amino acids 20 to 39; potential N-glycosylation sites at about amino 
acids 72 to 75, 154 to 157, 198 to 201, 212 to 215, and 326 to 329; a glycosaminoglycan attachment site at about 
amino acids 239 to 242. and a Ly-6/u-PAR domain at about amino acids 23 to 36. 

Analysis of the amino acid sequence of the full-length PRO 1074 polypeptide suggests that it possesses 
significant sequence similarity to beta 1 , 3-galactosy ltransferase , thereby indicating that PRO 1 074 may be a novel 
member of the galactosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length 
PRO 1074 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between 
the PRO1074 amino acid sequence and the following Dayhoff sequences: AF029792_1, P_R57433, 
DMU41449J, AC000348J4, P_R47479, CET09F5_2, CEF14B6_4, CET15D65, CEC54C8_4, and 
CEE03H4J0. 

Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209953. 

EXAMPLE 59 : Isolation of cD NA clones Encoding Human PRO 1005 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, Incyte cluster sequence no. 49243. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, C A) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., M? thods ip Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did noi encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56380, 

In light of an observed sequence homology between the DNA56380 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA256657, the Merck EST clone AA256657 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full -length 
protein. The sequence of this cDNA insert is shown in Figure 138 and is herein designated as DNA57708-141 1 . 

The full length clone shown in Figure 138 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 30-32 and ending at the stop codon found at nucleotide positions 
585-587 (Figure 138; SEQ ID NO:210). The predicted polypeptide precursor (Figure 139, SEQ ID NO:211) 
is 185 amino acids long. PRO1005 has a calculated molecular weight of approximately 20,331 daltons and an 
estimated pi of approximately 5.85. Clone DNA57708- 14 II was deposited with the ATCC June 23, 1998, and 
is assigned ATCC deposit no. 203021. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 139 (SEQ ID NO:211), evidenced some 
homology between the PRO1005 amino acid sequence and the following Dayhoff sequences: DDU07187 1, 
DDU87912J. CELD 1007 J4, A42239, DDU42597J, CYAG_DICDI, S50452, MRKCKLEPN, P-R41998, 
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and XYNARUMFL. 



RX AMPLE 60 : Isolation of cD NA clones Encoding Human PRO1073 

An initial DNA^sequence referred to herein as DNA55938 and shown in Figure 142 (SEQ ID NO:214) 
was identified using a yeast screen, in a human SK-Lu-l adenocarcinoma ceii line cDNA library that 
preferentially represents the 5* ends of the primary cDNA clones. DNA55938 was then compared to ESTs from 
public databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo 
Alto, C A), using the computer program BLAST or BLAST2 [ Altschul et al. . Methods in EnzymoloRY, 266:460- 
480 (1996)1- The ESTs were clustered and assembled into a consensus DNA sequence using the computer 
program "phrap" C™ 1 Grccn ' University of Washington, Seattle, Washington). The consensus sequence 
obtained is designated herein as DNA5641 1. 

In light of an observed sequence homology between the DNA5641 1 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H86027, the Merck EST clone H86027 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 140. 

The full length DNA577 10-1451 clone shown in Figure 140 contained a single open reading frame with 
an apparent translation^ initiation site at nucleotide positions 345-347 and ending at the stop codon found at 
nucleotide positions 1242-1244 (Figure 140; SEQ ID NO:212). The predicted polypeptide precursor (Figure 
141 , SEQ ID NO:213) is 299 amino acids long. PRO1073 has a calculated molecular weight of approximately 
34,689 daltons and an estimated pi of approximately 11.49. The PRO1073 polypeptide has the following 
additional features: a signal peptide at about amino acids 1-31. sequence identity to bZIP transcription factor 
basic domain signature at about amino acids, a potential N-glycosylation site at about amino acids 2-5, and 
sequence identity with protamine PI proteins at about amino acids 158-183. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 141 (SEQ ID NO:213), revealed some sequence 
identity between the PRO1073 amino acid sequence and the following Dayhoff sequences: MMU37351J, 
ATAC00250510T9J22.10, S59043, ENXNUPRJ, B47328, SR55_DROME, S26650, SONHUMAN, 
VIT2_CHICK, and XLC4SRPRTJ. 

Clone DNA57710-1451 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 

no. 203048. 

FX AMPLE 61: [Ration of cDNA clone s Encoding Human PRO 11 52 

A cDNA clone (DNA5771 1-1501) encoding a native human PROH52 polypeptide was identified by 
employing a yeast screen, in a human infant brain cDNA library that preferentially represents the 5* ends of the 
primary cDNA clones. Specifically, a yeast screen was employed to identify a cDNA designated herein as 
DNA55807 (SEQ ID NO:217; see Figure 145). 

In light of an observed sequence homology between the DNA55807 sequence and an EST sequence 
encompassed within the Merck EST clone no. R56756. the Merck EST clone R56756 was purchased and the 
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cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 143. 

The full-length DNA5771 1-1501 clone shown in Figure 143 contains a single open reading frame with 
an apparent translational initiation silt at nucleotide positions 58-60 and ending at the stop codon at nucleotide 
positions 1495-1497 (Figure 143). The predicted polypeptide precursor is 479 amino acids long (Figure 144). 
5 The full-length PRO 1 1 52 protein shown in Figure 1 44 has an estimated molecular weight of about 53 ,602 daltons 
and a pi of about 8.82. Analysis of the full-length PROl 152 sequence shown in Figure 144 (SEQ ID NO:216) 
evidences the presence of the following: a signal peptide from about amino acid I to about amino acid 28, 
transmembrane domains from about amino acid 133 to about amino acid 155, from about amino acid 168 to 
about amino acid 187, from about amino acid 229 to about amino acid 247 , from about amino acid 264 to about 

10 amino acid 285, from about amino acid 309 to about amino acid 330 T from about amino acid 371 to about amino 
acid 390 and from about amino acid 441 to about amino acid 464. potential N-glycosylation sites from about 
amino acid 34 to about amino acid 37 and from about amino acid 387 to about amino acid 390 and an amino acid 
sequence block having homology to a respiratory-chain NADH dehydrogenase subunit from about amino acid 
243 to about amino acid 287. Clone DNA5771 1-1501 has been deposited with ATCC on July 1, 1998 and is 

15 assigned ATCC deposit no. 203047. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 144 (SEQ ID NO:216), evidenced significant 
homology between the PROl 152 amino acid sequence and the following Dayhoff sequences: AF052239 1, 
SYNN9CGAJ, SFCYTB2J, GEN12507, PJU1769, MTV025J09, C61168, S43171, P_P61689 and 

20 P_P61696. 

EXAMPLE 62 : Isolation of cDN A clones Encoding Human PROl 136 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database, designated 109142. This EST cluster sequence was then compared 

25 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Ufeseq* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
el al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

30 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56039. 

In light of an observed sequence homology between the DNA56039 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. HSC1NF01 1 , the Merck EST clone HSC1NF01 1 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

35 protein. The sequence of this cDNA insert is shown in Figure 146 and is herein designated as DNA57827- 1493 . 

Clone DNA57827-1493) contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 2 16-2 1 8 and ending at the stop codon at nucleotide positions 2 1 12-2 114 (Figure 146) . 
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The predicted polypeptide precursor is 632 amino acids long (Figure 147). The full-length PR01136 protein 
shown in Figure 147 has an estimated molecular weight of about 69,643 daltons and a pi of about 8.5. Analysis 
of the full-length PROH36 sequence shown in Figure 147 (SEQ ID NO:219) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and potential N-glycosylaiion sites 
from about amino acid 108 to about amino acid i i, from about amino acid 157 to about amino acid 160, from 
5 about amino acid 289 to about amino acid 292 and from about amino acid 384 to about amino acid 387. Clone 
DNA57827-1493 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203045. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 147 (SEQ ID NO:219), evidenced significant 
homology between the PRO 1136 amino acid sequence and the following Dayhoff sequences: AF034746 1, 
10 AF034745J, MMAF000168J9, HSMUPP1J, AF060539J, SP97_RAT, 138757, MMU93309J, 
CEK01A6_4 and HSA224747J. 

EXAMPLE 63 : Isolation of cDNA clones Encoding Human PRQ813 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence (Incyte EST cluster sequence no. 45501. The Incyte EST cluster sequence no. 
45501 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ ™ . Incyte Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al. , Methods in Enzvmology 266:460-480 ( 1996)). Those comparisons resulting 

20 in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56400. 

In light of an observed sequence homology between the DNA56400 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T90592, the Merck EST clone T90592 was purchased 

25 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 148 and is herein designated DNA57834-1339. 

The full length clone shown in Figure 148 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 109-111 and ending at the stop codon found at nucleotide 
positions 637-639 (Figure 149; SEQ ID NO:221). The predicted polypeptide precursor is 176 amino acids long, 

30 has a calculated molecular weight of approximately 19,616 daltons and an estimated pi of approximately 7.11. 
Analysis of the full-length PR0813 sequence shown in Figure 149 (SEQ ID NO:221) evidences the presence of 
the following: a signal peptide from about amino acid I to about amino acid 26 and potential N-myristoylation 
sites from about amino acid 48 to about amino acid 53, from about amino acid 153 to about amino acid 158, 
from about amino acid 156 to about amino acid 161 and from about amino acid 167 to about amino acid 172. 

35 Clone DNA57834- 1339 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 
209954. 
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» Analysis of the amino acid sequence of the full-length PR0813 polypeptide suggests that it possesses 

sequence similarity to the pulmonary surfactant-associated protein C. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced some degree of homology between the PR0813 amino 
acid sequence and the following Dayhoff sequences, PSPC_MUS VI, P_P9>071, G02964, P_R65489, P_P82977, 
P_R84555, S55542, MUSIGHAJJ and PHI 158. 

5 

EXAMPLE 64 : Isolation of cDNA Clones Encoding Human PRO809 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g. , GenBanlc) and a proprietary EST DN A 

10 database (LIFESEQ™, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in Enzvmologv 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56418. 

In light of an observed sequence homology between the DNA56418 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H74302, ihe Merck EST clone H74302 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 150 and is herein designated DNA57836-1338. 

20 The entire nucleotide sequence of DNA57836-1338 is shown in Figure 150 (SEQ ID NO:222). Clone 

DNA57836-1338 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 63-65 and ending at the stop codon at nucleotide positions 858-860 of SEQ ID NO:222 (Figure 150). 
The predicted polypeptide precursor is 265 amino acids long (Figure 151). The full-length PRO809 protein 
shown in Figure 15 1 has an estimated molecular weight of about 29,061 daltons and a pi of about 9.18. Figure 

25 151 further shows the approximate positions of the signal peptide and N-glysosylation sites. The corresponding 
nucleotides can be determined by referencing Figure 150. Clone DNA57836-1338 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO809 polypeptide suggests that it possesses 

30 some sequence similarity to the heparin sulfate proteoglycan and to endothelial cell adhesion molecule- 1 . More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PRO809 amino acid sequence and the following Dayhoff sequences, PGBM_MOUSE, D82082 1 
and PW14158, 

35 EXAMPLE 65 : Isolation of cDNA Clones Encoding Human PRQ791 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
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sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DN A 
database (LIFESEQ™. Incytc Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 { Altshul et al. , Methods, in EnyymolQgY 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some case;; 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56429. 

In light of an observed sequence homology between the DNA56429 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 36367, the Merck EST clone 36367 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 152 and is herein designated DNA57838-1337. 

The entire nucleotide sequence of DNA57838-1337 is shown in Figure 152 (SEQ ID NO:224). Clone 
DNA57838-1337 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 9-11 and ending at the stop codon at nucleotide positions 747-749 of SEQ ID NO:224 (Figure 152). 
The predicted polypeptide precursor is 246 amino acids long (Figure 153). The full-length PR0791 protein 
shown in Figure 153 has an estimated molecular weight of about 27,368 daltons and a pi of about 7.45. Figure 
153 also shows the approximate locations of the signal peptide, the transmembrane domain, N-glycosylation 
sites and a region conserved in extracellular proteins. The corresponding nucleotides of one embodiment 
provided herein can be identified by referencing Figure 152. Clone DNA57838-1337 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR079 1 polypeptide suggests that it has sequence 
similarity with MHC-1 antigens, thereby indicating that PR0791 may be related to MHC-1 antigens. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some sequenc identity 
between the PR0791 amino acid sequence and the following Dayhoff sequences, AF034346_1 , MMQ1K51 and 
HFE_HUMAN. 

EXAMPLE 66 : Isolation of cD NA clones Encoding Human PRO 1004 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence, Incyte cluster sequence No. 73681 . This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al.. Methods in EnzvmoloEY 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap* (Phil Green, Univ. of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated as DNA56516. 

In light of an observed sequence homology between the DNA56516 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H43837, the Merck EST clone H43837 was purchased 
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and the cDNA insert was obtained and sequenced. Jt was found thai this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 154. 

The full length clone shown in Figure 154 contained a single open reading frame with an apparent 
translaiional initiation site at nucleotide positions 119-121 and ending at the stop codon at nucleotide positions 
464-466 (Figure 154; SEQ ID NO:226). The predicted polypeptide precursor is 1 15 amino acids long (Figure 
155; SEQ ID NO:227). The full-length PRO1004 protein shown in Figure 155 has an estimated molecular 
weight of about 13,649 daltons and a pi of about 9.58. Analysis of the full-length PRO1004 sequence shown 
in Figure 155 (SEQ ID NO:227) evidences the presence of the following features: a signal peptide at about amino 
acids 1-24, a microbodies C -terminal targeting signal at about amino acids 1 13-115. a potential N-glycosylation 
site at about amino acids 71-74, and a domain having sequence identity with dihydrofolate reductase proteins at 
about amino acids 22-48. 

Analysis of the amino ac id sequence of the full-length PRO 1004 polypeptide using the Dayhof f database 
(version 35.45 SwissProt 35) evidenced homology between the PRO1004 amino acid sequence and the following 
Dayhoff sequences: CELR02D3J7, LECl_MOUSE, AF006691J, SSZ97390J, SSZ97395J, and 
SSZ97400J. 

Clone DNA57844-1410 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no. 203010. 

S AMPLE 67 : Isolation of cDNA clones Encoding Hum an PROl 1 1 1 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which had homology to insulin-like growth factor binding protein. 

RNA for construction of cDNA libraries was isolated from human fetal brain. The cDNA libraries used 
to isolate the cDNA clones encoding human PROl 1 1 1 were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
conuining a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science, 
253:1278-1280 (1991)) in the unique Xhol and NotI. 

The human fetal brain cDNA libraries (prepared as described above), were screened by hybridization 
with a synthetic oligonucleotide probe based upon the Incyte EST sequence described above: 
5 '^C ACCACCTGGAGGTCCTGCAGTTGGGC AGG AACTCCATCCGGCAGATTG-3 * (SEQ ID NO:251). 

An identified cDNA clone was sequenced in entirety. The entire nucleotide sequence of PROl 1 1 1 is 
shown in Figure 156 (SEQ ID NO:228). Clone DNA58721-1475 contains a single open reading frame with an 
apparent translaiional initiation site at nucleotide positions 57-59 and a stop codon at nucleotide positions 2016- 
2018 (Figure 156; SEQ ID NO:228). The predicted polypeptide precursor is 653 amino acids long (Figure 157). 
The transmembrane domains are at positions 2M0 (type II) and 528-548. Clone DNA58721-1475 has been 
deposited with ATCC and is assigned ATCC deposit no. 203110. The full-length PROllll protein shown in 
Figure 157 has an estimated molecular weight of about 72,717 daltons and a pi of about 6.99. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 157 (SEQ ID NO: 229), revealed some sequence 
identity between the PROl 1 1 1 amino acid sequence and the following Dayhoff sequences: A58532, D86983_l , 
RNPLGPV 1, PGS2 HUMAN, AF038I271, ALS_MOUSE, GPVHUMAN, PGS2_BO\aN, ALS_PAPPA 
and 147020. 

5 

EXAMPLE 68 : Isolation of cDNA clones Encoding Human PRO 1344 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA33790. Based on the DNA33790 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01344. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' - AGGTTCGTG ATGG AG AC AACCGCG-3 ' (SEQ ID NO:232) 
reverse PCR primer 5-TGTCAAGGACGCACTGCCGTCATG-3' (SEQ ID NO:233) 
15 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA33790 
sequence which had the following nucleotide sequence 
hybridization PTobe 

5 -TGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCC-3' (SEQ ID NO:234) 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1344 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01344 (designated herein as DNA58723-1588 [Figure 158, SEQ ID NO:230]); and the derived protein 

25 sequence for PRO 1 344 . 

The entire nucleotide sequence of DNA58723-1588 is shown in Figure 158 (SEQ ID NO:230). Clone 
DNA58723-1588 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 26-28 and ending at the stop codon at nucleotide positions 2186-2188 (Figure 158). The predicted 
polypeptide precursor is 720 amino acids long (Figure 159). The full-length PROI344 protein shown in Figure 

30 159 has an estimated molecular weight of about 80, 199 daltorts and a pi of about 7.77. Analysis of the full- 
length PRO 1 344 sequence shown in Figure 159 (SEQ ID NO:231) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 23, an EGF-like domain cysteine protein signature 
sequence from about amino acid 260 to about amino acid 271 , potential N-glycosylation sites from about amino 
acid 96 to about amino acid 99, from about amino acid 279 to about amino acid 282, from about amino acid 316 

35 to about amino acid 319, from about amino acid 451 to about amino acid 454 and from about amino acid 614 
to about amino acid 6 17, an amino acid sequence block having homology to serine proteases, trypsin family from 
about amino acid 489 to about amino acid 505 and a CUB domain protein profile sequence from about amino 
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acid 150 to about amino acid 166. Clone DNA58723-1588 has been deposited with ATCC on August 18, 1998 
and is assigned ATCC deposit no. 203133. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 159 (SEQ ID NO: 231), evidenced significant 
homology between the PRO 1344 amino acid sequence and the following Dayhoff sequences: S77063_l, 
5 CRAR_MOUSE, P_R74775, P_P90070, PR09217, P_P70475, HSBMP16J and U50330J. 

EXAMPLE 69 : Isolation of cDNA clones Enco ding Human PROl 109 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52642. The consensus DNA sequence 

10 was obtained by extending using repeated cycles of BLAST and phrap a previously obtained consensus sequence 
as far as possible using the sources of EST sequences discussed above. Based on the DNA52642 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PROl 109. 
PCR primers (forward and reverse) were synthesized: 

15 forward PCR primer 5 ' -CCTT ACCTCAGAGGCC AGAGCAAGC-3 ' (SEQ ID NO:237) 
reverse PCR primer 5 ' -GAGCTTCATCCGTTCTGCGTTC ACC - 3 ' (SEQ ID NO:238) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52642 
sequence which had the following nucleotide sequence 

HyhriHiTajinn pmhe 

20 5 * -CAGGAATGTA AAGCTTTAC AGAGGGTCGCC ATCCTCGTTC CCC ACC-3 ' (SEQ ID NO:239) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 109 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human SK-Lu-1 adenocarcinoma cell tissue (LIB247). 

25 DNA sequencing of die clones isolated as described above gave the full-length DNA sequence for 

PRO1109 (designated herein as DNA58737-1473 [Figure 160, SEQ ID NO:235]) and the derived protein 
sequence for PROl 109. 

The entire nucleotide sequence of DNA58737-1473 is shown in Figure 160 (SEQ ID NO:235). Clone 
DNA58737-1473 contains a single open reading frame with an apparent translational initiation site at nucleotide 

30 positions 1 19-120 and ending at the stop codon at nucleotide positions 1 15 1-1 153 (Figure 160). The predicted 
polypeptide precursor is 344 amino acids long (Figure 161). The full-length PROl 109 protein shown in Figure 
161 has an estimated molecular weight of about 40,041 daltons and a pi of about 9.34. Analysis of the full- 
length PROl 109 sequence shown in Figure 161 (SEQ ID NO:236) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 27, potential N-glycosylation sites from about amino 

35 acid 4 to about amino acid 7, from about amino acid 220 to about amino acid 223 and from about amino acid 
335 to about amino acid 338 and an amino acid sequence block having homology to xylose isomerase proteins 
from about amino acid 191 to about amino acid 201. Clone DNA58737-1473 has been deposited with ATCC 
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on August 18, 1998 and is assigned ATCC deposit no. 203136. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 161 (SEQ ID N0.236), evidenced significant 
homology between the PROl 109 amino acid sequence and the following Dayhoff sequences: HSUDPGAL 1, 
HSUDPB14J, NALSBOVIN. HSU10473_1. CEW02312J1, YNKCAEEL, AE000738J1, CET24DM, 
5 S48121andCEGLY9_l. 

EXAMPLE 70 : Isolation of cDNA clones Encoding Human PRO 1383 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53961. Based on the DNA53961 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1383. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5^CATTTCCTTACCCTGGACCCAGCTCC-3' (SEQ ID NO:242) 
15 reverse PCR primer 5 ' -GAAAGGCCC AC AGCAC ATCTGGC AG-3 ' (SEQ ID NO:243) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA53961 
sequence which had the following nucleotide sequence 
hybridization probe 

5 , -CCACGACCCGAGCAACTTCCTCAAGACCGACTTGTTTCTCTACAGC-3 , (SEQ ID NO:244) 
20 In order to screen several libraries for a source of a full -length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1 383 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
25 PR01383 (designated herein as DNA58743-1609 [Figure 162, SEQ ID NO; 240]) and the derived protein 
sequence for PR01383, 

The enure nucleotide sequence of DNA58743-1609 is shown in Figure 162 (SEQ ID NO:240), Clone 
DNA58743- 1609 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 122-124 and ending at the stop codon at nucleotide positions 1391-1393 (Figure 162). The predicted 

30 polypeptide precursor is 423 amino acids long (Figure 163). The full-length PR01383 protein shown in Figure 
163 has an estimated molecular weight of about 46,989 daltons and a pi of about 6.77. Analysis of the full- 
length PR01383 sequence shown in Figure 163 (SEQ ID NO:241) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino acid 
339 to about amino acid 362, and potential N-glycosylation sites from about amino acid 34 to about amino acid 

35 37 , from about amino acid 58 to about amino acid 61 , from about amino acid 142 to about amino acid 145, from 
about amino acid 197 to about amino acid 200, from about amino acid 300 to about amino acid 303 and from 
about amino acid 364 to about amino acid 367. Clone DNA58743-1609 has been deposited with ATCC on 
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August 25, 1998 and is assigned ATCC deposii no. 203154. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 163 (SEQ ID NO:241), evidenced significant 
homology between the PRO 1383 amino acid sequence and the following Dayhoff sequences: NMB__HUMAN, 
QNRCOTJA, PW38335, PUS^CHICK, P_W38164, A45993J, MMU70209 1, D83704J andP_W39176. 

5 

EXAMPLE 71 : Isolation of cDNA Clones Encoding Human PRO 1003 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 43055. This sequence was then compared to a variety of EST 
databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

10 (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmoloav 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

1 5 obtained therefrom is herein designated consenO 1 . 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2849382, the Incyte EST clone 2849382 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 164. 

20 The entire nucleotide sequence of DNA58846-I409 is shown in Figure 164 (SEQ ID NO:245). Clone 

DNA58846-1409 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 41-43 and ending at the stop codon at nucleotide positions 293-295 (Figure 164). The predicted 
polypeptide precursor is 84 amino acids long (Figure 165). The full-length PRO 1003 protein shown in Figure 
165 has an estimated molecular weight of about 9,408 daltons and a pi of about 9.28. Analysis of the full-length 

25 PRO 1003 sequence shown in Figure 165 (SEQ ID NO: 246) evidences me presence of a signal peptide at amino 
acids 1 to about 24, and a cAMP- and cGMP-cependeni protein kinase phosphorylation site at about amino acids 
58 to about 61 . Analysis of the amino acid sequence of the full-length PRO 1003 polypeptide using the Dayhoff 
database (version 35.45 SwissProt 35) evidenced homology between the PRO 1003 amino acid sequence and the 
following Dayhoff sequences: AOPCZA363J. SRTX_ATREN. A48298, MHVJHMSJ, VGL2_CVMJH, 

30 DHDHTC2J, CORT_RAT, TAL6_HUMAN, P_W14I23. and DVUFI 2, 

EXAMPLE 72 : Isolation of cDNA Clones Encoding Human PRO1108 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53237. 
35 In light of an observed sequence homology between the DNA53237 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2379881, the Incyte EST clone 2379881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 166 and is herein designated DNA58848-1472. 

The entire nucleotide sequence of DNA58848-1472 is shown in Figure 166 (SEQ ID NO:247). Clone 
DNA58848-1472 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 1445-1447 (Figure 166). The predicted 
polypeptide precursor is 456 amino acids long (Figure 167). The full-length PRO 1 108 protein shown in Figure 

5 167 has an estimated molecular weight of about 52,071 daltons and a pi of about 9.46. Analysis of the full- 
length PROl 108 sequence shown in Figure 167 (SEQ ID NO:248) evidences the presence of the following:type 
II transmembrane domains from about amino acid 22 to about amino acid 42, from about amino acid 156 to 
about amino acid 176, from about amino acid 180 to about amino acid 199 and from about amino acid 369 to 
about amino acid 388, potential N-glycosylaion sites from about amino acid 247 to about amino acid 250, from 

10 about amino acid 327 to about amino acid 330, from about amino acid 328 to about amino acid 331 and from 
about amino acid 362 to about amino acid 365 and an amino acid block having homology to ER lumen protein 
retaining receptor protein from about amino acid 153 to about amino acid 190. Clone DNA58848-1472 has been 
deposited with ATCC on June 9, 1998 and is assigned ATCC deposit no. 209955. 

Analysis of the amino acid sequence of the full-length PROl 108 polypeptide suggests that it possesses 

15 significant sequence similarity to the LPAAT protein, thereby indicating that PRO 1 108 may be a novel LPAAT 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO 1 108 amino acid sequence and the following Dayhoff sequences, 
AF015811J, CER07E3J, YL35CAEEL, S73863, CEF59F4_4, P_W06422, MMU4 1736,1, MTV008J9, 
P_R99248 and Y67 BPT7. 

20 

EXAMPLE 73: i^iarinn of cDNA Clones Encoding Human PRO 1137 

The extracellular domain (ECD) sequences (including the secretion signal, if any) of from about 950 

known secreted proteins from the Swiss-Prot public protein database were used to search expressed sequence 

tag (EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
25 DNA database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the 

computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 2££:46O480 (1996)) as a 

comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. Using this procedure, 

Incyte EST No. 3459449, also referred to herein as U DNA7108\ was identified as an EST having a BLAST 

score of 70 or greater that did not encode a known protein. 
30 A consensus DNA sequence was assembled relative to the DNA7108 sequence and other ESTs using 

repeated cycles of BLAST and the program "phrap* (Phil Green, Univ. of Washington, Seattle, WA). The 

consensus sequence obtained therefrom is referred to herein as DNA53952. 

In light of an observed sequence homology between the DNA53952 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3663102, the Incyte EST clone 3663102 was purchased 
35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 168. 
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The entire nucleotide sequence of DNA58849-1494 is shown in Figure 168 (SEQ ID NO:249). Clone 
DNA58849-1494 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 797-799 (Figure 168). The predicted 
polypeptide precursor is 240 amino acids long (Figure 169). The full-length PROl 137 protein shown in Figure 
169 has an estimated molecular weight of about 26,064 daltons and a pi of about 8.65. Analysis of the full- 
5 length PRO 1137 sequence shown in Figure 169 (SEQ ID NO:250) evidences the presence of a signal peptide 
at about amino acids 1 to 14 and a potential N-glycosylation site at about amino acids 101-105. 

Analysis of the amino acid sequence of the full-length PROl 137 polypeptide suggests that it possesses 
significant sequence similarity to ribosyltransferase thereby indicating mat PROl 137 may be a novel member 
of the ribosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length PROl 137 
10 polypeptide using the Dayhoff database (version 35 .45 SwissProt 35) evidenced homology between the PRO H 37 
amino acid sequence and the following Dayhoff sequences: MMART5_l, NARG_MOUSE, GEN11909, 
GEN13794, GEN14406, MMRNART62J, and P_R41876. 

EXAMPLE 74 : Isolation of cDNA clones Encoding Human PRO 11 38 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence, Incyte cluster sequence no. 1652 12. This cluster sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al. ( Methods in Enzvmologv 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "pimp" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated as DN A54224 . The assembly included a proprietary 
Genentecb EST designated herein as DNA49140 (Figure 172; SEQ ID NO:254). 

25 In light of an observed sequence homology between the DNA54224 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3836613, the Incyte EST clone 3836613 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 170 and is the full-length DNA sequence for PROl 138. 
Clone DNA58850-1495 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

30 209956. 

The entire nucleotide sequence of DNA58850-1495 is shown in Figure 170 (SEQ ID NO:252). Clone 
DNA58850-1495 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 38-40 and ending at the stop codon at nucleotide positions 1043-1045 (Figure 170). The predicted 
polypeptide precursor is 335 amino acids long (Figure 171). The full-length PROl 138 protein shown in Figure 
35 171 has an estimated molecular weight of about 37,421 Daltons and a pi of about 6.36. Analysis of the full- 
length PRO 11 38 sequence shown in Figure 171 (SEQ ID NO:253) evidences the presence of the following 
features: a signal peptide at about amino acid 1 to about amino acid 22; a transmembrane domain at about amino 
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acids 224 to about 250; a leucine zipper pattern at about amino acids 229 to about 250; and potential N- 
glycosylation sites at about amino acids 98- 10 1, 142-145, 148-151, 172-175, 176-179. 204-207, and 291-295. 

Analysis of the amino acid sequence of the full-length PRO 1 138 polypeptide suggests that it possesses 
significant sequence similarity to the CD84, thereby indicating that PROl 138 may be a novel member of the Ig 
superfamily of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PROl 138 
5 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PROl 138 
amino acid sequence and the following Dayhoff sequences: HSU82988_ 1 , HUMLY9J . P_R9763 1 , P_R97628 , 
PR97629, P_R97630, CD48_RAT, CD2HUMAN, P_P93996, and HUMBGPJ. 

Clone DNA58850-1495 was deposited with ATCC on June 9, 1998, and is assigned ATCC deposit no. 

209956. 

10 

EXAMPLE 75 : Isolation of cDNA clones Encoding Human PRO 1054 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 66212. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 

15 and a proprietary EST DNA database (UFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

20 consensus sequence obtained therefrom is herein designated DNA55722. 

In light of an observed sequence homology between the DNA55722 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 319751, the Incyte EST clone 319751 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 173 and is herein designated as DNA58853-1423. 

25 Clone DNA58853-1423 contains a single open reading frame with an apparent translations initiation 

site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 586-588 (Figure 173). The 
predicted polypeptide precursor is 180 amino acids long (Figure 174). The full-length PRO 1054 protein shown 
in Figure 174 has an estimated molecular weight of about 20,638 daitons and a pi of about 5.0. Analysis of the 
full-length PRO 1054 sequence shown in Figure 174 (SEQ ID NO:256) evidences the presence of the following: 

30 a signal peptide from about amino acid 1 to about amino acid 1 8, a leucine zipper pattern from about amino acid 
155 to about amino acid 176 and amino acid sequence blocks having homology to lipocalin proteins from about 
amino acid 27 to about amino acid 38 and from about amino acid 110 to about amino acid 120. Clone 
DNA58853-I423 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203016. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID NO:256), evidenced significant 
homology between the PRO 1054 amino acid sequence and the following Dayhoff sequences: MUPl_MOUSE, 
MUP6_MOUSE. MUP2_MOUSE. MUP8_MOUSE. MUP5_MOUSE, MUP4_MOUSE, S10124, 
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MUPM_MOUSE, MUP_RAT and ECU70823J. 

EXAMPLE 76 ; Isolation of cDNft clones Encodine Human PR0994 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 157555. This EST cluster sequence was then compared 
5 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et ah, j4 ct hods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

10 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington), The 
consensus sequence obtained therefrom is herein designated DNA55728. 

In light of an observed sequence homology between the DNA55728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

15 The sequence of this cDNA insert is shown in Figure 175 and is herein designated as DNA58855-1422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 31-33 and ending at the stop codon at nucleotide positions 718-720 (Figure 175). The 
predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein shown 
in Figure 176 has an estimated molecular weight of about 25, 109 daltons and a pi of about 6.83. Analysis of 

20 the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO:258) evidences the presence of the 
following: transmembrane domains from about amino acid 10 to about amino acid 31, from about arnino acid 
50 to about amino acid 72, from about amino acid 87 to about amino acid 1 10 and from about amino acid 191 
to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, from 
about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and from 

25 about amino acid 163 to about amino acid 166 and an amino acid block having homology to TNFR/NGFR 
cysteine-rich region proteins from about amino acid 4 to about amino acid 1 1 . Clone DNA58855-1422 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID NO:258), evidenced significant 

30 homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204J, 
TAL6_HUMAN, ILT4HUMAN, JC6205, MMU57570J, S40363, ETU56093J, S42858, P_R66849 and 
P_R74751. 

EXAMPLE 77 : Isolation of cDNA clones Encoding Human PRQ812 
35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 170079. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
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and a proprietary EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmologv 266:460-480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 

5 consensus sequence obtained therefrom is herein designated as DNA55721 . 

In light of an observed sequence homology between the DNA55721 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 388964, the Incyte EST clone 388964 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 177 and is herein designated as DNA59205-1421. 

IQ Clone DNA59205-1421 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 304-306 (Figure 177). The 
predicted polypeptide precursor is 83 amino acids long (Figure 178). The full-length PR0812 protein shown 
in Figure 178 has an estimated molecular weight of about 9,201 daltons and a pi of about 9.3. Analysis of the 
full-length PR0812 sequence shown in Figure 178 (SEQ ID NO:260) evidences the presence of the following: 

15 a signal peptide from about amino acid 1 to about amino acid 15, a cAMP- and cGMP-dependeni protein kinase 
phosphorylation site from about amino acid 73 to about amino acid 76 and protein kinase C phosphorylation sites 
from about amino acid 70 to about amino acid 72 and from about amino acid 76 to about amino acid 78. Clone 
DNA59205-1421 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203009. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 178 (SEQ ID NO:260). evidenced significant 
homology between the PR0812 amino acid sequence and the following Dayhoff sequences: P_W35802, 
P_W35803, PSC1_RAT, S68231, GEN13917, PSC2_RAT, CC 10_HUMAN,UTER_RABIT, AF008595J and 
A56413. 

25 EXAMPLE 78 : Isolation of cD NA clones Encoding Human PRO 1069. 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence designated herein as 100727. This sequence was then compared to a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Metfrods in 

30 En^yjnjalagy. 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap 0 (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 
therefrom is herein designated DNA56001. 

In light of an observed sequence homology between the DN A3 6001 consensus sequence and an EST 

35 sequence encompassed within the Incyte EST clone no. 3533881 , the Incyte EST clone 3533881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 179 and is die full-length DNA sequence for PRO1069. 
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Clone DNA5921 1-1450 was deposited with the ATCC on June 9 t 1998, and is assigned ATCC deposit no. 
209960. 

The entire nucleotide sequence of DNA59211-1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
DNA5921 1-1450 contains a single open reading frame with an apparent trinslational initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466. The predicted polypeptide 
precursor is 89 amino acids long (Figure 180). The full-length PRO1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21 . Analysis of the full-length PRO1069 
sequence shown in Figure 180 (SEQ ID NO:262) evidences the presence of the following features: a signal 
peptide sequence at amino acid 1 to about 16; a transmembrane domain at about amino acids 36 to about 59; 
potential N-myristoylation sites at about amino acids 41-46, 45-50, and 84-89; and homology with extracellular 
proteins SCP/Tpx-l/Ag5/PR-i/Sc7 at about amino acids 54 to about 66. 

Analysis of the amino acid sequence of the full-length PRO1069 polypeptide suggests that it possesses 
significant sequence similarity to CHIF, thereby indicating that PRO1069 may be a member of the CHIF family 
of polypeptides . More particularly, analysis of the amino acid sequence of the full-length PRO 1069 polypeptide 
using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO1069 amino acid 
sequence and the following Dayhoff sequences: CHIF RAT, A55571, PLM_HUMAN, A40533. 
ATNG BOVIN, RIC_MOUSE, PETDJYNY3, VTB1_XENLA, A05009, and S75086. 

Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209960. 

EXAMPLE 79 : Isolation of cD NA Clones Encoding Human PRQ1129 

Use of the signal sequence algorithm described ia Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 98833. The Incyte EST cluster sequence no. 98833 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in EnzvmoloRV 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington. 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56038. 

In light of an observed sequence homology between the DNA56038 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1335241 . the Incyte EST clone 1335241 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 181 and is herein designated DNA59213-1487. 

The full length clone shown in Figure 181 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
1614-1616 (Figure 181; SEQ ID NO:263). The predicted polypeptide precursor is 524 amino acids long, has 
a calculated molecular weight of approximately 60,310 daltons and an estimated pi of approximately 7.46. 
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Analysis of the full-length PR01129 sequence shown in Figure 182 (SEQ ID NO:264) evidences the presence 
of the following: type II transmembrane domains from about amino acid 13 to about amino acid 32 and from 
about amino acid 77 to about amino acid 102, a cytochrome P-450 cysteine heme-iron ligand signarure sequence 
from about amino acid 461 to about amino acid 470 and potential N-glycosylation sites from about amino acid 
1 12 to about amino acid 1 15 and from about amino acid 168 to about amino acid 171 . Clone DNA59213-1487 
5 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209959. 

Analysis of the amino acid sequence of the full-length PROU29 polypeptide suggests that it possesses 
sequence similarity to the cytochrome P-450 family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PR01129 amino acid 
sequence and the following Dayhoff sequences, AC004523J, S45702, AF054821J and 153015. 

10 

EXAMPLE 80 : Isolation of cDNA clones Encoding Human PRO 1068 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, designated Incyte cluster no. 141736. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 

15 (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. One or more of the ESTs was derived from a human mast cell line from a patient 
with mast cell leukemia. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 

20 consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56094. 

In light of an observed sequence homology between the DNA56094 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 004974, the Incyte EST clone 004974 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA59214-1449 (SEQ 
ID NO:265). 

The full length clone shown in Figure 183 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
414-416 (Figure 183; SEQ ID NO:265). The predicted polypeptide precursor (Figure 184, SEQ ID NO:266) 

30 is 124 amino acids long. PRO 1068 has a calculated molecular weight of approximately 14,284 daltons and an 
estimated pi of approximately 8.14. The PRO1068 polypeptide has the following additional features, as 
indicated in Figure 184: a signal peptide sequence at about amino acids 1-20, a urotensin II signature sequence 
at about amino acids 1 18-123, a cell attachment sequence at about amino acids 64-66, and a potential cAMP- 
and cGMP-dependent protein kinase phosphorylation site at about amino acids 1 12-1 15. 

35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO: 266), revealed homology 
between the PRO 1068 amino acid sequence and the following Dayhoff sequences: HALBOP_l, MTV043 36, 



442 



WO 99/63088 



PCT/US99/12252 



150498, and PR78445 

Clone DNA59214-1449 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no. 203046. 

EXAMPLE 81 : Isolation of cD NA clones Enc^mg Human PRQ1066 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 79066. The incyte EST cluster sequence no. 79066 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (HFESEQ™, Incyte Pharmaceuticals, Palo Alto. CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (AJtshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56121 . 

In light of an observed sequence homology between the DNA56121 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1515315, the Incyte EST clone 1515315 was purchased 
and the cDNA insert was obtained and sequenced. U was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 185 and is herein designated DNA59215-1425. 

The full length clone shown in Figure 185 contained a single open reading frame with an apparent 
translational iniuation site at nucleotide positions 176-178 and ending at the stop codon found at nucleotide 
positions 527-529 (Figure 185; SEQ ID NO:267). The predicted polypeptide precursor is 1 17 amino acids long, 
has a calculated molecular weight of approximately 12,91 1 daltons and an estimated pi of approximately 5.46. 
Analysis of the full-length PRO1066 sequence shown in Figure 186 (SEQ ID NO:268) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a cAMP- and cGMP- 
dependent protein kinase phosphorylation site from about amino acid 38 to about amino acid 41 and potential 
N-myristoylation sites from about amino acid 5 to about amino acid 10, from about amino acid 63 to about amino 
acid 68 and from about amino acid 83 to about amino acid 88. Clone UNQ524 (DNA592 15-1425) has been 
deposited with the ATCC on June 9. 1998 and is assigned ATCC deposit no. 209961. 

Analysis of the amino acid sequence of the full-length PRO 1066 polypeptide suggests that it does not 
possess significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1066 amino acid 
sequence and the following Dayhoff sequences, MOTI.HUMAN, AF025667J, MTCY19H9J and 
RABIGKCHJ. 

EXAMPLE 82: Isolation of c DNA Clones EncodinR Human PROl 184 

Use of the signal sequence algorithm described in Example 3 on ESTs from an Incyte database allowed 
identification a candidate sequence designated herein as DNA56375. This sequence was then compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
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a proprietary EST DNA database (LIFESEQ™, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater thai did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap'* (Phil Green, University of Washington. Seattle, Washington) , The 
5 consensus sequence obtained therefrom is herein designated DNA56375, 

In light of an observed sequence homology between the DNA56375 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 1428374, the lncyte EST clone 1428374 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 187. 

10 The full length clone shown in Figure 187 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 106-108 and ending at the stop codon found at nucleotide 
positions 532-534 (Figure 187; SEQ ID NO: 269). The predicted polypeptide precursor is 142 amino acids long, 
has a calculated molecular weight of approximately 15,690 daltons and an estimated pi of approximately 9.64. 
Analysis of the full-length PROl 184 sequence shown in Figure 188 (SEQ ID NO:270) evidences the presence 

15 of a signal peptide at about amino acids 1-38. Clone DNA59220-1514 has been deposited with the ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual sequences and that representations are 
presented herein. 

Analysis of the amino acid sequence of the full-length PROl 184 polypeptide suggests that it possesses 
some sequence identity with a protein called TIM from Drosophila virilis, designated M DVTIMS02 1" in the 
20 Dayhoff data base, (version 35.45 SwissProt 35). Other 

Dayhoff database (version 35.45 SwissProt 35) sequences having some degree of sequence identity with 
PROH84 include: WISl_SCHPO, F002I86J, ATAC00239124 and MSArPRPJ. 

EXAMPLE 83 : Isolation of cDN A clones Encoding Human PRO 1360 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

sequence from an lncyte database, designated DNA 10572. This EST sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank, Merck/Wash. 
U.) and a proprietary EST DNA database (LIFESEQ*, lncyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

30 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA57314. 

In light of an observed sequence homology between the DNA57314 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. AA406443, the Merck EST clone AA406443 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 189 and is herein designated as DNA59488- 1603. 
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The full length clone shown in Figure 189 contained a single open reading frame with an apparent 
translation^ initiation site at nucleotide positions 54-56 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 189; SEQ ID NO:271). The predicted polypeptide precursor (Figure 190, SEQ ID NO:272) 
is 285 amino acids long. PRO 1360 has a calculated molecular weight of approximately 31,433 daltons and an 
estimated pi of approximately 7.32. Clone DNA59488-1603 was deposited with the ATCC on August 25, 1998 
5 and is assigned ATCC deposit no. 203157. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 190 (SEQ ID NO: 272), revealed sequence identity 
between the PRO 1360 amino acid sequence and the following Dayhoff sequences: UN51CAEEL, 
YD4BJCHPO, AF000634J, GFO_ZYMMO, YEU_SCHPO, D86566J, ZMGFOJ, S76976, 
10 PPSA_SYNY3, and CEF28B14. 

EXAMPLE 84: Isolation of cDNA clones Encoding Human PRO1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 18763. This EST cluster sequence was then compared 

15 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmologv 266:460-480 ( 1 996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

20 DNA sequence with the program **phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57854. 

In light of an observed sequence homology between the DNA57854 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T98880, the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 191 and is herein designated as DNAS9493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 39-4 1 and ending at the stop codon at nucleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The full-length PRO1029 protein shown 
in Figure 192 has an estimated molecular weight of about 9,548 daltons and a pi of about 8.52. Analysis of the 

30 full-length PRO 1029 sequence shown in Figure 1 92 (SEQ ID NO:274) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rbodopsins retinal binding site protein from about amino acid 50 to about amino acid 6 1 , a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-glycosylation site from about 
amino acid 45 to about amino acid 48. Clone DNA59493-1420 has been deposited with ATCC on July 1 1 1998 

35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO: 274), evidenced significant 
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homology between the PRO 1029 amino acid sequence and the following Dayhoff sequences: S66088, 
AF031815J, MM4A6LJ, PSEIS52a-l, S17699 and P_R63635. 

EXAMPLE 85 : Isolation of cDN A clones Encoding Human PRO 11 39 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

5 cluster sequence from the Incyte database, designated 4461 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
el al. t Methods in Enzvmoloflv 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or 

10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone , identified herein as a putative protein coding 

15 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PROU39 (Figure 193). 
As noted before, the deduced amino acid sequence of DN A59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HSOBRGRPJ. The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28, and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120), 

20 

EXAMPLE 86 : Isolation of cDNA clones Encoding Human PRO 1309 

An expressed sequence tag (EST) DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for construction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDN A clones encoding human PRO 1 309 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al. , Science, 
30 253:1278-1280 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described Incyte EST sequence: 

S , -TCC<J^GCAG<KKKJACGCCT^TCAGAAACTGCGCCGAGTTAAGGAAC-3 , (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
35 1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent translation^ initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 2286-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 
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long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428^*50 
of SEQ ID NO:278. Clone DNA59588-1571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The full-length PRO1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 196 (SEQ ID NO:278), revealed sequence identity 
between the PRO1309 amino acid sequence and the following Dayhoff sequences: AB007876 J , GPVMOUSE, 
ALS RAT, P_R85889, LUM_CHICK, AB014462J, PGS1_CANFA, CEM88J7, A58532 and GEN 1 1209. 

EXAMPLE 87 : Isolation of cDNA Clones Encoding Human PRO1028 

10 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 

1 5 Enzvmologv 266:460-480 (1996)) . Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 

20 contained within Incyte EST clone no. 1497725 , the Incyte EST clone no. 1497725 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 197 and is herein designated as DNA59603-1419. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA59603-1419 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 

25 positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 amino acids long (Figure 198). The full-length PRO1028 protein shown in Figure 
198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA5 9603-1419 
has been deposited with the ATCC, Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

30 Analyzing the amino acid sequence of SEQ ID NO:281, the putative signal peptide is at about amino 

acids 1-19 of SEQ ID NO:281 . An N-glycosylation site is at about amino acids 35-38 of SEQ ID NO:281 . A 
C-type lectin domain is at about amino acids 108-1 17 of SEQ ID NO:281 , indicating that PR0513 may be related 
to or be a lectin. The corresponding nucleotides of these amino acid sequences or others can be routinely 
determined given the sequences provided herein. 

35 
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EXAMPLE 88 : Isolation of cDNA Clones Encoding Human PRO1027 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte PharmaceuticaJs, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56399. 

10 In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 

contained within Incyte EST clone no. 937605, the Incyte EST clone no. 937605 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-1418 is shown in Figure 199 (SEQ ID NO:282). Clone 

1 5 DNA59605- 14 1 8 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 31-33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO 1027 protein shown in Figure 
200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62. Clone DNA59605-1418 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 

20 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO 1027 may be involved in tissue formation or repair. 

25 The following Dayhoff designations appear to have some sequence identity with PRO 1027: 

SFT2_YEAST; ATM3E9J; A69826; YM16_MARPO; E64896; U60193_2; MTLRAJ205J ; MCU60315J70; 
SPAS_SHIFL;andS54213. 

EXAMPLE 89: Isolation of cDNA Clones Encoding Human PRO1107 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 

35 Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the Incyte EST clone no. 3203694 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 201 and is herein designated as DNA59606-1471. 
5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA5 9606- 1471 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The full-length PROl 107 
protein shown in Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 
10 Clone DNA59606-1471 has been deposited with ATCC on June 9, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
significant sequence similarity to phosphodiesterase I/nucleotide phyrophosphatase, human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and autotaxin, thereby indicating that PRO 1107 may have 
at least one or all of the activities of these proteins, and that PRO! 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 3545 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at Least the following Dayhoff sequences: AF005632 1, 
P_R79148, RNU78787J, AF060218_4, A57080 and HUMATXTl. 

20 

EXAMPLE 90 : Isolation of cDNA clones Encoding Human PROl 140 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

25 EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 

30 therefrom is herein designated DNA564 1 6 . 

In light of an observed sequence homology between DNA56416and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203, is the full-length DNA sequence for PROl 140. Clone DNA59607-I497 was deposited 

35 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 (SEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The full-length PROl 140 protein shown in Figure 
204 has an estimated molecular weight of about 29,405 daltons and a pi of about 7.64. Analysis of the full- 
length PROl 140 sequeitce shown in Figure 204 (SEQ ID NO:287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 1 18, 141 to 161 and 172 to 191. 
5 Analysis of the amino acid sequence of the full-length PRO 1 140 polypeptide using the Dayhoff database 

(version 35.45 SwissProt 35) evidenced homology between the PROl 140 amino acid sequence and the following 
Dayhoff sequences: AF023602J, AF000368_1, CIN3_RAT, AF003373J, GEN13279, and AF003372J. 

Clone DNA59607-1497 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209946. 

10 

EXAMPLE 91 : Isolation of cDNA clones Enc oding Human PROl 106 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence. This sequence was then compared to a variety of expressed sequence tag (EST) databases 
which included public EST databases (e.g. , GenBank) and a proprietary EST DNA database (UFESEQ™, Incyte 

15 Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2 (Altshul etal., Methods in En zvmologv 266:460480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program M phrap M (Phil Green, 
Univ. of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated 

20 DNA56423. 

In light of an observed sequence homology between DNA56423 and an EST sequence contained within 
Incyte EST clone no. 1711247, Incyte EST clone no. 171 1247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59609- 1470, which 
is shown in Figure 205, is the full-length DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 
25 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DNA59609-1470 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 61-63 and ending at the stop codon at nucleotide positions 1463- 1470 of SEQ ID NO:288 (Figure 205). 
The predicted polypeptide precursor is 469 amino acids long (Figure 206). The full-length PROl 106 protein 
30 shown in Figure 206 has an estimated molecular weight of about 52,689 daltons and a pi of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region(s) and/or either of the 
amino terminal or carboxyl end can be excluded. Clone DNA59609-1470 has been deposited with ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the 
35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 1106 polypeptide suggests that it possesses 
significant sequence similarity to the peroxisomal ca-dependent solute carrier, thereby indicating that PRO 1 106 
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may be a novel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity between the PROl 106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161J, IG002N0l_2S, GDC_BOVIN and BTIMAIZE. 

EXAMPLE 92 : Isolation of cPNA clones Encoding Human PRQ1291 

5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

10 et al., Methods in EnzvmoloflV 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56425. 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 

15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA59610-1556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 

20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PR01291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 
the full-length PR01291 sequence shown in Figure 208 (SEQ ID NO:291) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 

25 amino acid 115, from about amino acid 1 60 to about amino acid 163 , from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223. , Clone DNA596 10-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between the PR01291 amino acid sequence and the following Dayhoff sequences: HSU90552 1, 
HSU90144J, AF033107J, HSB73J, HSU90142J, GGCD80J, P_W34452, MOG_MOUSE, B39371 and 
P_R71360. 

35 
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EXAMPLE 93 : Isolation of cDNA clones Encoding Human PRO 1105 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq* lncyte Pharmaceuticals. Palo Alto. CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

10 In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 

encompassed within the lncyte EST clone no. 1853047, the lncyte EST clone 1853047 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA59612-1466. 

The entire nucleotide sequence of DNA59612- 1466 is shown in Figure 209 (SEQ ID NO:292). Clone 

1 5 DNA596 12-1466 contains a single open reading frame with an apparent trans! ational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO: 292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PRO 1 105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone 

20 has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO: 293. It is 
understood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
25 of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

EXAMPLE 94: Isolation of cDNA clones Encoding Human PRQ5U 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

30 cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

35 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light of an observed sequence homology between the DNA56434 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1227491, the Incyte EST clone 1227491 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 21 1 and is herein designated as DNA59613-1417. 

The entire nucleotide sequence of DNA59613-1417 is shown in Figure 21 1 (SEQ ID NO: 294). Clone 
5 DNA59613-1417 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 233-235 and ending at the stop codon at nucleotide positions 944-946 (Figure 21 1). The predicted 
polypeptide precursor is 237 amino acids long (Figure 212). The full-length PROS 1 1 protein shown in Figure 
212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA59613-1417 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
10 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 295, the putative signal peptide is at about amino 
acids 1-25 of SEQ ID NO:295. The N-glycosylation sites are at about amino acids 45-48. 73-76, 107-1 10, 1 18- 
121, 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295, An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO: 295. A kringle domain begins at about amino acid 128 of SEQ 
15 ID NO: 295 and a Iy-6/u-PAR domain begins at about amino acid 6 of SEQ ID NO: 295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
herein. 

The designations appearing in a Dayhoff database with which PR051 1 has some sequence identity are 
as follows: SSC20F10J; SF041083; P_W26579; S44208; JC2394; PSTA_DICDI; A27020; S59310; 
20 RAGl_RABIT; and MUSBALBC1_1. 

EXAMPLE 95 : Isolation of cDNA clones Encoding Human PRO 1 104 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

25 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmologv 266:46(M80 (1996*)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

30 the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DNA56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 

35 sequence of this cDNA insert is shown in Figure 213 and is herein designated as DNA59616-1465. 

The entire nucleotide sequence of DNA59616-1465 is shown in Figure 213 (SEQ ID NO:296). Clone 
DNA59616- 1465 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 109-11 1 and ending at the stop codon at nucleotide positions 1132-1134 of SEQ ID NO: 296 (Figure 
213). The predicted polypeptide precursor is 341 amino acids long (Figure 214). The full-length PRO1104 
protein shown in Figure 214 has an estimated molecular weight of about 36,769 daltons and a pi of about 9.03. 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
5 sequencing techniques. 

Analyzing Figure 214, a signal peptide is at about amino acids 1-22 of SEQ ID NO:297. N- 
myristoylation sites are at about amino acids 41-46, 110-115, 133-138, 167-172 and 179-184 of SEQ ID 
NO:297. 

10 EXAMPLE 96 : Isolation of cDNA clones Encoding Human PRO 1100 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. ( Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 

20 sequence encompassed within the Incyte EST clone no. 2305379, the Incyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found thai this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA59619-1464. 

The entire nucleotide sequence of DNA59619-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO:298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PRO 1 100 protein 
shown in Figure 216 has an estimated molecular weight of about 36,475 daltons and a pi of about 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 

30 techniques. 

Upon analyzing SEQ ID NO: 299, the approximate locations of the signal peptide, the transmembrane 
domains, an N-glycosylation site, an N-myristoylation site, a CUB domain and an amiloride -sensitive sodium 
channel domain are present. It is believed that PROl 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO: 299.. 

35 
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EXAMPLE 91 : Isolation of cD NA clones Encoding Human PRQ836 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag ( EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 ( Altshul et al. , Methods in 
Enzymojogy. 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained is herein designated DNA56453. 

In light of an observed sequence homology between the DNA56453 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2610075, the Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The entire nucleotide sequence of DNA59620-1463 is shown in Figure 217 (SEQ ID NO:300). Clone 
DNA59620-1463 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448- 1450 of SEQ ID NO.300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PR0836 protein 
shown in Figure 218 has an estimated molecular weight of about 52,085 daltons and a pi of about 5.36. Analysis 
of the full-length PR0836 sequence shown in Figure 218 (SEQ ID NO:301) evidences the presence of the 
following: a signal peptide, N-glycosylation sites, N-myristoylation sites, a domain conserved in the 
YJL126w/YLR351c/yhcX family of proteins, and a region having sequence identity with SLS1. Clone 
DNA59620-1463 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analysis of the amino acid sequence of the full-length PR0836 polypeptide suggests that it possesses 
some sequence similarity to SLSl, thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PR0836 amino acid sequence and at least the following Dayhoff sequences, S58132, 
SPBC3B9J, S66714, CRU40057J and IMAJTAEEL. 

EXAMPLE 98 : Isolation of cDNA clones Encoding H uman PRO 1 141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 11873. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (UFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
cl al. , Methods in Enzvmologv 266:460-480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2679995, the lncyte EST clone 2679995 was purchased 

5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA59625-I498. 

Clone DNA59625-1498 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 204-206 and ending at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The full-length PROl 141 protein 

10 shown in Figure 220 has an estimated molecular weight of about 26 , 840 daltons and a pi of about 8.19. Analysis 
of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO:303) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19 and transmembrane domains from 
about amino acid 38 to about amino acid 57, from about amino acid 67 to about amino acid 83, from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 

15 DNA59625-1498 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 220 (SEQ ID NO:303), evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L_2, 
PCRB7PRJJ, AB000506J, LEU95O08J, MRU87980J5, YIGM_ECOLI, STU65700J, GHU62778J, 

20 CYSTSYNY3 and AF009567 1 . 

EXAMPLE 99 : Isolation of cD NA clones Encoding Human PROl 132 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
25 consensus sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 11 32. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5'-TCCTGTGACCACCCCTCTAACACC-3' (SEQ ID NO:310) and 
30 reverse PCR primer : 5'-CTGGAAC ATCTGCTGCCCAGATTC-3 ' (SEQ ID NO:31 1). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5'-GTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCGACTGCGATATGC-3' (SEQ ID NO:312). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROl 132 and the derived protein sequence for PROU32. 

The entire nucleotide sequence of PRO 11 32 is shown in Figure 225 (SEQ ID NO:308). Clone 
DNA59767-1489 contains a single open reading frame with vn apparent translational initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO;308). The 
5 predicted polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 
the histidine active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
deposited with ATCC (having the actual sequence rather than representations based on sequencing techniques 
as presented herein) and is assigned ATCC deposit no. 203108. The full-length PR01132 protein shown in 
Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:309), revealed sequence identity 
between the PR01132 amino acid sequence and the following Dayhoff sequences: SSU76256J, P_W10694, 
MMAE000663_6, AF013988J. U66061J, MMAE000665_2, MMAE00066415, MMAE00066414, 
MMAE000665_4 and MMAE00066412. 

15 

EXAMPLE 100: Isolation of cDNA clones Encoding Human NL7 (PRO 1346) 

A single EST sequence (#1398422) was found in the LIFESEQ* database as described in Example I 
above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 *-C AC ACGTCC AACCTC AATGGGC AG- 3 ' (SEQ ID NO:315) 
reverse PCR primer : 5'-GACCAGCAGGGCCAAGGACAAGG-3' (SEQ ID NO:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which had the following nucleotide sequence: 
hybridization probe: 

5'-GTTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3' 
(SEQIDNO:317) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for NL7 
(designated herein as DNA59776-1600 [Figure 227, SEQ ID NO:3 13J) and the derived protein sequence for NL7 
35 (PRO 1346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID N0:313). Clone 
DNA59776- 1600 contains a single open reading frame with an apparent translational initiation site at nucleotide 



457 



WO 99/63088 



PCT/US99/12252 



positions 1-3 and an apparent stop codon at nucleotide positions 1384-1386. The predicted polypeptide precursor 
is 46 1 amino acids long. The protein contains an apparent type II transmembrane domain at amino acid positions 
from about 31 to about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid position 409, and a leucine zipper pattern starting at about amino acid positions 140, 147, 154 and 
161, respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203 128, The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50,744 
daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a human microfibril-associated 
glycoprotein (1 MFA4_HUMAN); to known T1E-2 ligands and ligand homologues, ficolin, serum lectin and 
10 TGF-1 binding protein. 

EXAMPLE 101 : Isolation of cDN A clones Encoding Human PRQU31 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 

1 5 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 

20 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627. 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is 
a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science 253:1278-1280 (1991)), 

25 and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and 2 reverse) were synthesized: 
forward PCR primer S'-ATGCAGGCCAAGTACAGCAGCAC-3' (SEQ ID NO:321); 
reverse PCR primer 1 5 ' -C ATGCTG ACGACTTCCTGC A AGC-3 ' (SEQ ID NO: 322); and 
reverse PCR primer 1 5'-CCACACAGTCTCTGCiTCTTGGG-3' (SEQ ID NO:323) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 , -ATGCTGGATGATGATGGGGACACCACCATGAGCCTGCATT-3 t (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROH31 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:3l8). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 3.1,966 daltons and an estimated pi of approximate 'y 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ 10 NO:319 and the region having sequence identity 
5 with LDL receptors is about 50-265 of SEQ ID NO:319, PROl 131 contains potential N-linked glycosylation 
sites at amino acid positions 95-98 and 169- 172 of SEQ ID NO:3 19. Clone DNA59777- 1480 has been deposited 
with the ATCC and is assigned ATCC deposit no. 203111. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in.Figure 230 (SEQ ID NO:319), evidenced some sequence 
1 0 identity between the PRO 1 1 3 1 amino acid sequence and the following Dayhoff sequences ; ABO 10710 1, 149053 , 
149115, RNU56863J, LY4A_MOUSE, 155686, MMU56404J, 149361, AF030313J and MMU09739J. 

EXAMPLE 102: Isolation of cDNA clones Encoding Human PRQ1281 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1281 . 
PCR primers (forward and reverse) were synthesized: 

forward PCR Primers: 
20 5'-TGGAAGGCTGCCGCAACGACAATC-3 ' (SEQ ID NO:327); 

5 ' -CTG ATGTGGCCG ATGTTCTG-3 ' (SEQ ID NO:328); and 

5 * - ATGGCTC AGTGTGC AG AC AG-3 ' (SEQ ID NO:329). 

reverse PCR primers: 

5 ' -GC ATGCTGCTCCGTG AAGT AGTCC -3 ' (SEQ ID NO:330); and 
25 5 ' - ATGC ATGGG AAAG AAGGCCTGCCC -3 * (SEQ ID NO:331). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hybridization probe: 

5 f -TGC ACTGGTGACC ACG AGGGGGTGC ACTATAGCCATCTGG AGCTGAG-3 ' (SEQ ID NO:332). 
30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 

to isolate clones encoding the PRO 1281 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 [Figure 232, SEQ ID NO:325]; and the derived protein 

sequence for PR01281. 
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The entire coding sequence of PR01281 is shown in Figure 232 (SEQ ID NO;325). Clone DNA59820- 
1549 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
228-230 and an apparent stop codon at nucleotide positions 2553-2555. The predicted polypeptide precursor 
is 775 amnio acids long. The full-length PR0128I protein shown in Figure 233 has an estimated molecular 
weight of about 85,481 daltons and a pi of about 6.92. Additional features include a signal peptide ai about 
amino acids 1-15; and potential N-glycosylation sites at about amino acids 138-141 and 361-364. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 233 (SEQ ID NO:326), revealed some sequence 
identity between the PR01281 amino acid sequence and the following Dayhoff sequences: S44860, CET24D1J, 
CEC38H2J, CAC2_HAECO, B3A2HUMAN, S22373, CEF38A3_2, CEC34F6J, CEC34F63. and 
CELT22B113. 

Clone DNA59820-1549 has been deposited with ATCC and is assigned ATCC deposit no. 203129, 

FX AMPLE 103 : Isolation of cDNA clones E ncoding Human PRO 1064 

A cDNA sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
BLAST2 sequence alignment computer program, to have no significant sequence identity to any known human 
protein. This cDNA sequence is herein designated DNA45288. The DNA45288 sequence was then compared 
to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database 
(LIFESEQ®, lncyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The comparison 
was performed using the computer program BLAST or BLAST2 [Altschul et al.. Methods in Enzvmology. 
266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is 
herein designated DNA48609. Oligonucleotide primers based upon the DNA48609 sequence were then 
synthesized and employed to screen a human fetal kidney cDNA library which resulted in the identification of 
the DNA59827-I426 clone shown in Figure 234. The cloning vector was pRK5B (pRK5B is a precursor of 
pRK5D that does not contain the Sfil site; see, Holmes et al., Science . 252:1278-1280 (1991)), and the cDNA 
size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PCR primer 5'-CTGAGACCCTGCAGCACCATCTG-3' (SEQ ID NO:336) 
reverse PCR primer 5 '-GGTGCTTCTTG AGCCCC ACTT AGC -3 ' (SEQ ID NO:337) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA48609 sequence which had the following nucleotide sequence 
hybridization probe 

5 ' -AATCT AGCTTCTCC AGGACTGTGGTCGCCCCGTCCGCTGT-3 ' (SEQ ID NO:338) 

A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 532-534 and a stop signal at nucleotide positions 991-993 
(Figure 234, SEQ ID NO:333). The predicted polypeptide precursor is 153 amino acids long, has a calculated 
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molecular weight of approximately 17,317 daltons and an estimated pi of approximately 5. 17. Analysis of the 
full-length PRO 1064 sequence shown in Figure 235 (SEQ ID NO:334) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino 
acid 89 to about amino acid 1 10, an indole-3-gIycerol phosphate synthase homology block from about amino acid 
74 to about amino acid 105 and a Myb DNA binding domain protein repeat protein homology block from about 

5 amino acid 1 14 to about amino acid 137. Clone DNA59827-1426 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203089. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 235 (SEQ ID NO:334), evidenced homology 
between the PRO 1064 amino acid sequence and the following Dayhoff sequences: MMNP15PR01, 

10 BP187PLYHJ , CELF42G8_4,MMU58888_1 .GEN14270, TUB8_SOLTU, RCN_MOUSE,HUMRBSY79_l , 
SESENODAJ and A21467J, 

EXAMPLE 104 : Isolation of cDNA clones Encoding Human PRO 1379 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1379. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5 ' -TGG AC ACCGT ACCCTGGT ATCTGC - 3 ' (SEQ ID NO:341) 
reverse PCR primer 5 ' -CC AACTCTG AGGAG AGCAAGTGGC-3 ' (SEQ ID NO:342) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45232 sequence which had the following nucleotide sequence: 
hybridization probe 

25 5*-TGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAAC-3' (SEQ ID NO:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01379 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal kidney tissue. 
30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 

derived protein sequence for PR01379 (SEQ ID NO:340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DNA59828- 

1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
35 10- 12 and an apparent stop codon at nucleotide positions 1732-1734. The predicted polypeptide precursor is 574 

amino acids long. The full-length PR01379 protein shown in Figure 238 has an estimated molecular weight of 

about 65,355 daltons and a pi of about 8.73. Additional features include a signal peptide at about amino acids 
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1-17 and potential N-glycosylation sites at about amino acids 160-163, 287-290, and 323-326. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 238 (SEQ ID NO: 340), revealed some homology 
between the PRO 1379 amino acid sequence and ihe following Dayhoff sequences: YHY8YEAST, AF040625_ I , 
HP714394J, and HIV18U45630J. 
5 Clone DNA59828-1608 has been deposited with ATCC and is assigned ATCC deposit no. 203158. 

EXAMPLE 105 : Isolation of cDNA Clones Encoding Human PRQ844 

An expressed sequence tag (EST) DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed sequence identity with aLP. Based on the information 
10 and discoveries provided herein, the clone for this EST, Incyte clone no. 2657496 from a cancerous lung library 
was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-1462; 
SEQ ID NO: 344) which includes the full-length DNA sequence for PR0844 and the derived protein sequence 
for PR0844. 

1 5 The entire nucleotide sequence of DNA59838- 1462 is shown in Figure 239 (SEQ ID N0:344). Clone 

DNA59838-1462 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO: 344 (Figure 239). 
The predicted polypeptide precursor is 111 amino acids long (Figure 240). The full-length PR0844 protein 
shown in Figure 240 has an estimated molecular weight of about 12,050 daltons and a pi of about 5.45. Clone 

20 UNQ544 DNA59838-1462 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0844 polypeptide suggests that it possesses 
significant sequence similarity to serine protease inhibitors, thereby indicating that PR0844 may be a novel 
25 proteinase inhibitor. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0844 amino acid sequence and at least the following Dayhoff 
sequences, ALK1_HUMAN, P_P82403, P_P82402 ( ELAF^HUMAN and P_P60950, 

EXAMPLE 106: Isolation of cDNA Clones Encoding Human PRQ848 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. t Methods in 

35 Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2768571 , the Incyte EST clone 2768571 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 241 and is herein designated as DNA59839-I46L 
5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839-1461 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO: 346 (Figure 
241). The predicted polypeptide precursor is 600 amino acids long (Figure 242). The full-length PR0848 
protein shown in Figure 242 has an estimated molecular weight of about 68,536 daltons. Clone DNA59839- 146 1 

10 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0848 polypeptide suggests that it may be a 
novel sialy [transferase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 

15 sequences, P_R78619 (GalNAc-alpha-2, 6-sialyltransferase), CAAG5CHICK (alpha-n-acetylgalactosamide 
alpha-2,6-sialytransferase) JlSUHSSO^UCAGe^UMANand P_R632I7 (human alpha-2, 3-sialyltransferase). 

EXAMPLE 107 : Isolation of cDNA Clones Encoding Human PRO 1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using die computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

25 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56006 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2408105, the Incyte EST clone 2408105 was purchased 

30 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA5984 1-1460. 

The entire nucleotide sequence of DNA59841-1460 is shown in Figure 243 (SEQ ID NO: 34$). Clone 
DNA5984 1-1460 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO:343 (Figure 243). 

35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO1097 protein 
shown in Figure 244 has an estimated molecular weight of about 10,542 daltons and a pi of about 10.04. Clone 
DNA5984 1-1460 has been deposited with ATCC on July 1 , 1998. It is understood that the deposited clone has 
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the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 244, the signal peptide is at about amino acids 1-20 of SEQ ID NO:349. The 
glycoprotease family protein domain starts at about amino acid 56, and the acyltrans ferase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO:349. 

5 

EXAMPLE 108 :1*nlation of cDNA clones Encoding Human PRO! 153 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

10 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap** (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56008. 

In light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409, the Incyte EST clone 2472409 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 245 and is herein designated as DNA59842-1502. 

20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 92-94 and ending at the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:351) 
is 197 amino acids long. PRO 1 153 has a calculated molecular weight of approximately 21,540 daltons and an 
estimated pi of approximately 8.31. Clone DNA59842-1502 has been deposited with ATCC and is assigned 

25 ATCC deposit no. 209982. It is understood that the correct and actual sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PRO 1153 shows some amino acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPCJ; S46965; P_P82971; VCPHEROPHJ; EXTN_TOBAC; MLCB2548_9; 

30 ANXA_RABIT; JC5437 and SSGP_VOLCA. 

EXAMPLE 109 : Isolation of cDNA clones Encoding Human PROl 154 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
3 5 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et a!., Methods in 
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Enzvmology 266:460-480 (1996)), Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program M phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56025. 

In light of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyte EST clone no. 2 169375, the Incyte EST clone 2 169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86-88 and ending at the stop codon found at nucleotide positions 
10 2909-291 1 (Figure 247; SEQ ID N0:352). The predicted polypeptide precursor (Figure 248, SEQ ID NO:353) 
is 941 amino acids long. PRO 1 154 has a calculated molecular weight of approximately 107, 144 daltons and an 
estimated pi of approximately 6.26. Clone DNA59846-1503 has been deposited with ATCC and is assigned 
ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
1 5 length sequence , PRO 1 ! 54 shows sequence identity to at least the following Dayhoff designations : ABO 1 1 097_1 , 
AMPN_HUMAN, RNU76997J, 159331, GEN 14047, HSU62768J ,P_R51281 ,CET07F10J, SSU66371J, 
and AMPREHUMAN. 

EXAMPLE 110: Isolation of cDNA clones Encoding Human PROl 181 

20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (UFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

25 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DNA56029 consensus sequence and an EST 

30 sequence encompassed within the Incyte EST clone no. 2 1 86536, the Incyte EST clone 2 1 86536 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-151 1. 

Clone DNA59847-1511 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 

35 The predicted polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown in Figure 250 has an estimated molecular weight of about 46,363 daltons and a pi of about 6.22. Analysis 
of the full-length PROl 181 sequence shown in Figure 250 (SEQ ID NO: 355) evidences the presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15, potential N-glycosylation sites from 
about amino acid 46 to about amino acid 49, from about amino acid 189 to about amino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid ill. Clone DNA59847-I5I 1 has been deposited with ATCC on August 4, 1998 and is assigned 
5 ATCC deposit no. 203098. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 250 (SEQ ID NO:355), evidenced homology 
between the PRO! 181 amino acid sequence and the following Dayhoff sequences: AF041083_1, P_W26579, 
RNMAGPIANJ, CELT13C2_2, LMSAP2GNJ, S61882, CEF35C5J2, DP87_DICDI, GIU47631J and 
10 PJW7092. 

EXAMPLE ill : Isolation of cDNA clones Encoding Human PRO 1 182 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 

15 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

20 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56033. 

In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195, the Incyte EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-1512. 

Clone DNA59848-1512 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 25 1). The 
predicted polypeptide precursor is 27 1 amino acids long (Figure 252). The full-length PRO 1 1 82 protein shown 
in Figure 252 has an estimated molecular weight of about 28,665 daltons and a pi of about 5.33. Analysis of 

30 the full-length PROH82 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25, an amino acid block having 
homology to C-type lectin domain proteins from about amino acid 247 to about amino acid 256 and an amino 
acid sequence block having homology to Clq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-I512 has been deposited with ATCC on August 4, 1998 and is assigned ATCC deposit 

35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 252 (SEQ ID NO:357), evidenced significant 
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homology between the PRO 11 82 amino acid sequence and the following Dayhoff sequences: PSPD_BOVIN, 
CL43_BOVIN, CONG_BOVIN, P_W18780, P_R45005, P_R53257 and CELEGAP7_1. 

EXAMPLE 112: Isolatior of cDNA clones Encoding Human PRO 1155 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
5 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST) databases which included public EST databases (e.g., Genfiank) and a proprietary 

EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 

Enzympjogy. 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
10 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2858870, the Incyte EST clone 2858870 was purchased 
15 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 

The full length clone shown in Figure 253 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 

positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254, SEQ ID 
20 NO : 359) is 1 35 amino acids long. PRO 1 1 55 has a calculated molecul ar weight o f approximately 14, 833 daltons 

and an estimated pi of approximately 9.78. Clone DNA59849-1504 has been deposited with ATCC and is 

assigned ATCC deposit no. 209986. It is understood that the actual clone has the correct sequence whereas 

herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
25 length sequence, PRO 1 155 shows some amino acid sequence identity with the following Dayhoff designations: 

TKNK BOVIN; PVB19X587J; AF019049J; PJV00948; S72864; P_W00949; 162742; AF038501J; 

TKNG_HUMAN; and YATl_RHOBL. Based on the information provided herein, PROl 155 may play a role 

in providing neuroprotection and cognitive enhancement. 

30 EXAMPLE 113: Isolation of cDNA clones Encoding Human PROl 156 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 

35 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56261. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191, the Incyte EST clone 3675191 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 255 and is herein designated as DNA59853-1505. 

The full length clone shown in Figure 255 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO:360). The predicted polypeptide precursor (Figure 256, SEQ ID 
NO: 361) is 159 amino acids long. PRO 11 56 has a calculated molecular weight of approximately 17,476 daltons, 
10 an estimated pi of approximately 9.15, a signal peptide sequence at about amino acids 1 to about 22, and 
potential N-glycosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
15 alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361), revealed some homology between the PRO 1156 amino acid sequence and the following Dayhoff 
sequences: D45027J, P_R79914, JC5309, KBF2_HUMAN, AF010144J, GEN14351, S68681, P_R79915, 
ZMTAC3, and HUMCPGOJ. 

20 EXAMPLE 114 : Isolation of cDNA Clones Encoding Human PRO1098 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. f Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 

30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917, the Incyte EST clone 3050917 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 

35 DNA598S4- 1459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide positions 292-294 of SEQ ID NO: 362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258), The full-length PRO 1098 protein 
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shown in Figure 258 has an estimated molecular weight of about 8,3% daitons and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 258, a signal peptide appears to be at about amino acids 1-19 of SEQ ID NO: 363, 
5 an N-glycosylation site appears to be at about amino acids 37-40 of SEQ ID NO: 363, and N-myristoylation sites 
appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

EXAMPLE 115 : Isolation of cDNA clones Encoding Human PROl 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

10 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

15 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program 4 *phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57959. 

In light of an observed sequence homology between the DNA57959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685126, the Merck EST clone 685126 was purchased 

20 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 259 and is herein designated as DNA60283-1484. 

The full length clone shown in Figure 259 contained a single open reading frame with an apparent 
translarional initiation site at nucleotide positions 126-128 and ending at die stop codon found at nucleotide 
positions 327-329 (Figure 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260, SEQ ID 

25 NO:365) is 67 amino acids long including a signal peptide at about 1 - 29 of SEQ ID NO: 365. PROl 127 has a 
calculated molecular weight of approximately 7,528 daitons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
sequencing errors are presented herein. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO: 365), revealed some homology 
between the PROH27 amino acid sequence and the following Dayhoff sequences: AF037218_48, PW09638, 
HBA HETPO, S39821, KR2EBV, CET20D3J, HCU37630J, HS193B12J0, S40012 and TRITUBCJ. 

35 
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FX /VMPLE 116: Isolation of cDNA clones En mding Human PRO 1 126 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (L1FESEQ* Incyte Pharmaceuticals, Palo Aito, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., MeWs in 
Enzvmologv 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program *phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 

In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 261 and is herein designated as DNA60615-1483. 

Clone DNA60615-1483 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 1 10-1 12 and ending at the stop codon at nucleotide positions 13 16-1318 (Figure 261). 
The predicted polypeptide precursor is 402 amino acids long (Figure 262). The full-length PR01126 protein 
shown in Figure 262 has an estimated molecular weight of about 45,92 1 daltons and a pi of about 8.60. Analysis 
of the full-length PROH26 sequence shown in Figure 262 (SEQ ID NO:367) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylation sites 
from about amino acid 66 to about amino acid 69, from about amino acid 1 38 to about amino ac id 14 1 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-1483 has been deposited with ATCC on June 
16, 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 262 (SEQ ID NO:367), evidenced significant 
homology between the PR01126 amino acid sequence and the following Dayhoff sequences: 173636, 
NOMR_HUMAN, MMUSMYOC3J, HS454G6J, P_R98225, RNU78105J, RNU72487J, AF035301J. 
CEELC48E7_4 and CEF1 1C3_3. 

EXAMPLE 117 : Isolation of cD NA clones Encoding Human PRQ112S 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., MfffrotiS in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56540. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the lncyte EST clone 14861 14 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-le igth protein. 
The sequence of this cDNA insert is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 47-49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368). The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PROl 125 has a calculated molecular weight of approximately 49,798 daltons and an 
estimated pi of approximately 9.78. Clone DNA60619-I482 has been deposited with ATCC and is assigned 

10 ATCC deposit no. 209993. It is understood that the clone has the actual sequence and that the sequences herein 
are representations based on current techniques which may be prone to minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PRO 1 125 shows some sequence identity with the following Dayhoff designations: 
RCOI_NEUCR; S58306; PKWA_THECU; S76086; P_R85881; HETl_PODAN; SPU92792J; 

15 APAF_HUMAN; S76414 and S59317. 

EXAMPLE 1 18 : Isolation of cDNA clones Encoding Human PROl 186 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®. Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., M$lhoa> in, 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56748. 

In light of an observed sequence homology between the DNA56748 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3476792, the Incyte EST clone 3476792 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 91-93 and ending at the stop codon found at nucleotide positions 
406-408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266, SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids 1-19 of SEQ ID NO:371. PROl 186 has a 

35 calculated molecular weight of approximately 1 1,715 daltons and an estimated pi of approximately 9.05. Clone 
DNA60621-1516 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203091. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PR01186 amino acid sequence and the following Dayhoff sequences: VPRADENPO, 
LFE4_CHICK, AF034208J, AF030433J, A55035, COL RABIT, CELB0507_9, S67826J, S34665 and 
CRU73817J. 

5 

EXAMPLE 119 : Isolation of cDNA clones Encoding Human PRQ1198 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbilical vein endothelial cell cDNA library that preferentially represents the 5* ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public databases (e.g., GenBank), and a proprietary 

10 EST database (LIFESEQ 0 , Incyte Pharmaceuticals; Palo Alto, CA), using the computer program BLAST or 
BLAST2 [Altschul et al. f Methods in Enzvmology. 266:460-480 (1996)]. The ESTs were clustered and 
assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green, University of 
Washington, Seattle, Washington). One or more of the ESTs was obtained from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 

15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910, the Incyte EST clone 3852910 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The full length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO: 372) contained a single open 

20 reading frame with an apparent translational initiation site at nucleotide positions 54 to 56 and ending at the stop 
codon found at nucleotide positions 741 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO:373), is 229 amino acids long. PROl 198 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9. 17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyl hydrolases family 31 protein at about amino acids 

25 142 to about 175. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 268 (SEQ ID NO: 373), revealed some homology 
between the PROl 198 amino acid sequence and the following Dayhoff sequences: ATF6H1 16, UCRIRAT, 
TOBSUP2NTJ. RCUERF3J, AMU88186J, P_W22485, S56579, AF040711J, DPP4_PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203090. 

EXAMPLE 120: Isolation of cDNA clones Encoding Human PROl 158 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
35 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
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homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzyjn2l2gy_ 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DNA57248 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2640776, the Incyte EST clone 2640776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-1507. 

The full length clone shown in Figure 269 contained a single open reading frame with an apparent 

10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypeptide precursor (Figure 270, SEQ 
ID NO:375) is 123 amino acids long. PR01158 has a calculated molecular weight of approximately 13,1 13 
daltons and an estimated pi of approximately 8.53. Additional features include a signal peptide sequence at about 
amino acids 1-19, a transmembrane domain at about amino acids 56-80, and a potential N-glycosylauon site at 

15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the ATCC on June 16, 1998 and is 
assigned ATCC deposit no. 209975. 

An analysis of the Dayhoff database (version 35,45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 270 (SEQ ID NO:375), revealed some homology 
between the PROH58 amino acid sequence and the following Dayhoff sequences: ATAC00310510F18A8.10, 

20 P_R85151, PHS2_SOLTU, RNMHCIBACJ. RNA1FMHCI, 168771, RNRT1A10GJ, PTPA_HUMAN, 
HUMGACA_1, and CHKPTPAJ. 

EXAMPLE 121: Isolation of cDNA clones Encoding Human PRO 1 159 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57221. 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 376776, the Incyte EST clone 376776 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 271 and is herein designated as DNA60627-1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 92-94 and ending at the stop codon ai nucleotide positions 362-364 (Figure 271). The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PROl 159 protein shown 
in Figure 272 has an estimated molecular weight of about 9,840 daltons and a pi of about 10. 13. Analysis of 
the full-length PROU59 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 15 and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41 . Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO:377), evidenced significant 
10 homology between the PROl 159 amino acid sequence and the following Dayhoff sequences: AF0 16494_6, 
AF036708_20, DSSCUTEJ, D89100J, S28060, MEFAXENLA, AF020798J2,G70065, E64423,JQ2005. 

EXAMPLE 122 : Isolation of cDNA clones Encoding Human PRO 1124 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

20 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 was purchased 

25 and the cDNA insert was obtained and sequenced, It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-1481. 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274, SEQ ID NO:379) 

30 is 9 19 amino acids long. PRO 1 124 has a calculated molecular weight of approximately 101 ,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629- 1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It is understood that the deposited clone has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

35 Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PROl 124 shows 

significant amino acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROl 124: ECLC_BOVIN, 
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AF001261J, P_W06548, SSC6A10J. AF004355J, S76691, AF017642. BYU06866_2, CSAJDICDI and 
SAU47139_2. 

EXAMPLE 123: Isolation of cDNA clones Encodi ng Human PR01287 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
5 was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 
then compared to various EST databases including public EST databases (e.g. , GenBank), and a proprietary EST 
database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The 
comparison was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods jn 
Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 
10 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus 
sequence obtained is herein designated DNA40568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1 ) to identify by PCR 
a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full- 
15 length coding sequence for PRO 1287. Forward and reverse PCR primers generally range from 20 to 30 
nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The probe sequences 
are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 
sequence is greater than about l-l.Skbp. In order to screen several libraries for a full-length clone, DNA from 
the libraries was screened by PCR amplification, as per Ausubel et al. , Current Protocols in Molecular Biology, 
20 supra, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -CTCGGGG A A AGGGACTTG ATGTTGG-3 ' (SEQ ID NO:382) 
reverse PCR primer 1 S'-GCGAAGGTGAGCCTCTATCTCGTGCCO' (SEQ ID NO:383) 
25 reverse PCR primer 2 S'-CAGCCTACACGTATTGAGGO* (SEQ ID NO:384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
^yhvjHi^tinn probe 

5 ' -CAGTCAGTAC AATCCTGGC ATAAT AT ACGGCC ACCATGATGC AGTCCC-3 * (SEQ ID NO:385). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PRO 1287 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human bone marrow tissue. The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 

35 reagents such as those from Invitrogen, San Diego, C A. The cDNA was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
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pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al. , Science . 253 ; 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01287 (designated herein as DNA61755-1554 [Figure 275, SEQ ID NO:380J) and the derived protein 
sequence for PR01287. 

5 The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 

full length clone contained a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275, SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61 ,35 1 daltons and an estimated pi of approximately 8.77. Analysis of the full-length PRO 1287 sequence shown 

10 in Figure 276 (SEQ ID NO:381) evidences the presence of the following: a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites from about amino acid 3 15 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA6 1755- 1554 has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203 112. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

15 alignment analysis of the full-length sequence shown in Figure 276 (SEQ ID NO:381), evidenced significant 
homology between the PRO 1287 amino acid sequence and the following Dayhoff sequences: CET24D11, 
EZRI_BOVTN, GGU19889J, CC3_YEAST, S74244, NALSMOUSE, MOES_PIG, S28660, S44860 and 
YNA4CAEEL. 

20 EXAMPLE 124 : Isolation of cDNA clones Encoding Human PRQ1312 

DNA55773 was identified in a human fetal kidney cDNA library using a yeast screen that preferentially 
represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
synthesized for use as probes to isolate a clone of the full-length coding sequence for PRO 1 3 12. 

The full length DNA61873-1574 clone shown in Figure 277 (SEQ ID NO:386) contained a single open 

25 reading frame with an apparent translational initiation site at nucleotide positions 7-9 and ending at die stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID NO:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pi of approximately 6.26. Other features include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 14 1 - 1 60, and potential N-glycosylation sites at about amino acids 

30 76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 278 (SEQ ID NO:387), revealed some homology 
between the PRO 13 12 amino acid sequence and the following Dayhoff sequences: GCINTALPH1, 
GIBMUCIAJ, P_R96298 f AFO014O6J, PVU88874 1, P_R85151, AF041409J, CELC50F2 7, C45875, 
35 and ABOO9510 21. 

Clone DNA61873-1574 has been deposited with ATCC and is assigned ATCC deposit no. 203132. 
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EXAMPLE 125 : Isolation of cDNA clones Encoding Human PR01192 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein DNA35924. Based on the DNA35924 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PR01192. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5* -CCGAGGCCATCTAGAGGCCAGAGC-3 * (SEQ ID NO:390) 
reverse PCR primer : 5 ' - AC AGGC AGAGCC AATGGCC AG AGC-3 ' (SEQ ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
10 DNA35924 sequence which had the following nucleotide sequence: 
hybridization probe: 

5 * -G AG AGG ACTGCGGG AGTTTGGGACCTTTGTGC AGACGTGCTC ATG-3 * (SEQ ID NO:392). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 192 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROH92 designated herein as DNA62814-1521 and shown in Figure 279 (SEQ ID NO:388); and the derived 

protein sequence for PR01192 which is shown in Figure 280 (SEQ ID NO:389). 
20 The entire coding sequence of PROl 192 is shown in Figure 279 (SEQ ID NO:388). Clone DNA62814- 

1521 contains a single open reading frame with an apparent iranslational initiation site at nucleotide positions 

121-123 and an apparent stop codon at nucleotide positions 766-768. The predicted polypeptide precursor is 215 

amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 

amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-glycosylation sites at 
25 about amino acids 39-42 and 1 18-121 ; and homology with myelin P0 proteins at about amino acids 27-68 and 

99-128 of Figure 280. The full-length PROl 192 protein shown in Figure 280 has an estimated molecular weight 

of about 24.484 daltons and a pi of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the rull-length sequence shown in Figure 280 (SEQ ID NO:389), revealed homology 
30 between the PROl 192 amino acid sequence and the following Dayhoff sequences: GEN12838, MYP0HUMAN. 

AF049498J, GEN14531, P_W14146, HS46KDA1, CINB RAT, OX2G_RAT, D87018J, and D86996J. 
Clone DNA62814-1521 was deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203093. 

35 
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EXAMPLE 126 : Isolation of cDNA clones Encoding Human PRO1160 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA4065O. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PRO 1160. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5 ' -GCTCCCTG ATCTTC ATGTC ACC ACC-3 ' (SEQ ID NO:395) 
reverse PCR primer 5 ' -C AGGG AC AC ACTCTACC ATTCGGG AG-3 ' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
frvhridization probe 

S'-CCATCnTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTC^' (SEQ ID NO:397) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 160 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROU60 (designated herein as DNA62872-1509 [Figure 281, SEQ ID NO: 393]) and the derived protein 

sequence for PROl 160. 

20 The entire nucleotide sequence of DNA62872-1509 is shown in Figure 281 (SEQ ID NO:393). Clone 

DNA62872-I509 contains a single open reading frame with an apparent translationai initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The full-length PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9,039 daltons and a pi of about 4.37. Analysis of the full-length 

25 PROl 160 sequence shown in Figure 282 (SEQ ID NO:394) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 282 (SEQ ID NO:394), evidenced significant 
homology between the PRO 11 60 amino acid sequence and the following Dayhoff sequences: B30305, 
GEN13490, 153641, S53363, HA34_BRELC, SP96_DICDI, S36326, SSU51197J0, MUC1_XENLA, 
TCU32448J and AF000409J. 

35 EXAMPLE 127 : of cPNA Clones, fingoflinp! Htiffian PRO U 87 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identity existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57726. 

In light of an observed sequence homology between the DNA57726 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 358563, the Incyte EST clone 358563 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

10 The sequence of this cDNA insert is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO: 399) is 120 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO: 399. 

15 PRO 1187 has a calculated molecular weight of approximately 12,925 daJtons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO: 399), revealed some sequence 
identity (and therefore some relation) between the PRO 1 187 amino acid sequence and the following Dayhoff 
sequences: MGNENDOBXl , CELF41G3_9, AMPGSTRLI, HSBBO VHERL2 , LEEXTEN10J, 
AF029958 1 and P_W04957. 

25 EXAMPLE 128: Isolation of cDNA clones Encoding Human PROl 185 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program tl phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 328441 1 , the Incyte EST clone 328441 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

The full length DNA62881-15 15 clone shown in Figure 285 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ ID NO:400). The predicted polypeptide precursor (Figure 286, 
SEQ ID NO:401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:40i . 
5 PROH85 has a calculated molecular weight of approximately 22,105 daltons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
10 identity between the PROH85 amino acid sequence and the following Dayhoff sequences: TUPI YEAST, 
AF041382J, MAOM_SOLTU, SPPBPHU9J.I41024, EPCPLCFAILJ, HSPLECJ, YKU_CAEEL, 
A44643, TGU65922J. 

EXAMPLE 129 : Isolation of cDfMA clones Eflcoflng Human PRO 1345 
15 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1345. 

20 PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 , -CCTGGTTATCCCCAGGAACTCCGAC-3' (SEQ ID NO:404) 

reverse PCR primer 5'-CTCTTGCTGCTGCGACAGGCCTC-3' (SEQ ID NO:405) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 

sequence which had the following nucleotide sequence 

25 hybridization probe 

5 ' -CGCCCTCC AAGACTATGGT AA A AGGAGCCTGCC AGGTGTC AATG AC-3 ' (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1 345 gene using the probe oligonucleotide and one of the PCR primers. RNA 

30 for construction of the cDNA libraries was isolated from human breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402]) and the derived protein 
sequence for PR01345. 

The entire nucleotide sequence of DNA64852-1589 is shown in Figure 287 (SEQ ID NO:402). Clone 
35 DN A64852- 1589 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 7-9 or 34-36 and ending al the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PRO 1345 protein shown in Figure 
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288 has an estimated molecular weight of about 23, 190 daltons and a pi of about 9.40. Analysis of the full- 
length PR01345 sequence shown in Figure 288 (SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 31 or from about amino acid 10 to about amino acid 
31 and a C-type lectin domain signature sequence from about anino acid 176 to about amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18, 1998 and is assigned ATCC deposit no. 203127. 
5 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO:403), evidenced significant 
homology between the PR01345 amino acid sequence and the following Dayhoff sequences: BTU22298_1, 
TETN_CARSP, TETNHUMAN, MABA_RAT, S34198, PW13144, MACMBPAJ, A46274, PSPDRAT 
AND P_R32188. 

10 

EXAMPLE 130: Isolation of cDNA clones E ncoding Human PRO 1245 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

15 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap M (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

20 obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1327836, the Incyte EST clone 1327836 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 289 and is herein designated as DNA64884-1527. 

25 The full length clone shown in Figure 289 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290, SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about amino acid 1 to about amino acid 18. 
PRO 1245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 

30 approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYATHETH, GENII 167, 
MTV044 4, AB011151J, RLAJ2750J, SNELIPTRAl, S63624, C28391, A37907, and S14064. 

35 Clone DNA64884-1245 was deposited with the ATCC on August 25, 1998 and is assigned ATCC 

deposit no. 203155. 
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EXAMPLE 131 : Isolation of cD NA clones Enco ding Human PRO 1358 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul ei ah, Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 

1 0 encompassed within the Incyte EST clone no. 887 1 8 , the Incyte EST clone 887 1 8 was purchased and the cDN A 
insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The sequence 
of this cDNA insert is shown in Figure 291 and is herein designated as DNA64890-1612. 

The full length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 

15 positions 1418 through 1420 (Figure 291; SEQ ID NO:409). The predicted polypeptide precursor (Figure 292, 
SEQ ID NO:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PRO 135 8 has a calculated molecular weight of approximately 50,719 daltons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18, 1998 and is assigned 
ATCC deposit no. 203131. 

20 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 292 (SEQ ID NO:410), revealed sequence identity 
between the PR01358 amino acid sequence and the following Dayhoff sequences: P_W07607, AB000545_1, 
AB000546J, A1ATRAT, AB0l5164_l,P_P5002i 1 COTR CAVPO, and HAMHPPl. The variants claimed 
in this application exclude these sequences. 

25 

EXAMPLE 132: Isolation of cDNA clones EncoflnR Human PROl 195 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

30 EST DNA database (LIFESEQ 0 , Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah. Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

35 obtained therefrom is herein designated DNA55716. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3252980, the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 293 and is herein designated as DNA65412-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
translations! initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
511-513 (Figure 293; SEQ ID NO:4U). The predicted polypeptide precursor (Figure 294, SEQ ID NO:412) 
5 is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PROl 195 has 
a calculated molecular weight of approximately 17,277 daltons and an estimated pi of approximately 5 . 33 . Clone 
DNA65412-1523 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412), revealed some sequence 
10 identity between the PRO! 195 amino acid sequence and the following Dayhoff sequences: MMU28486_1, 
AF044205J, P_W31186, CELK03C7J, F69034, EF1A_METVA, AF024540J, SSU90353J, 
MRSP_STAAU and P_R97680. 

EXAMPLE 133: Isolation of cDNA clones Encoding Human PRO 1270 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. t Methods in 

20 Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program M phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57951. 

In light of an observed sequence homology between the DNA57951 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 

30 The predicted polypeptide precursor is 313 amino acids long (Figure 296). The full-length PRO 1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pi of about 5.71 . Analysis 
of the full-length PRO1270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 

3 5 74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308-1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203159. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 296 (SEQ ID NO:414), evidenced significant 
homology between the PRO1270 amino acid sequence and the following Dayhoff sequences: XLU86699J, 
S49589, FIBAPARPA, FIBBHUM AN , PR47 1 89, AF004326 1 , DRTEN ASCN_ 1 , AF004327_ 1 , P_W0141 1 
and FIBG_BOVIN. 

5 

AMPLE 134 : Isolation of cDNA clones Enc oding Human PRQ1271 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

10 EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA625350, the Merck EST clone AA625350 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 297 and is herein designated as DNA66309-1538. 

20 Clone DNA66309-1538 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 7 18-720 (Figure 297) . The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PRO 1271 protein shown 
in Figure 298 has an estimated molecular weight of about 21 ,531 daltons and a pi of about 8.99. Analysis of 
the full-length PRO 1271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 

25 following: a signal peptide from about amino acid 1 to about amino acid 31 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Clone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 

30 homology between the PR01271 amino acid sequence and the following Dayhoff sequences: S57180, S63257, 
AGA1YEAST, BPU43599J, YS8A_CAEEL, S67570, LSU54556_2, S70305, VGLX_HSVEB, and 
D88733J. 

EXAMPLE 135 : Isolation of cDNA clones Encoding Human PRQ1375 
35 A Merck/Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
databases (e.g., GenBank, Merck/Wash. U.), and a proprietary EST database (LIFESEQ* Incyte 
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Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
[Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)] as a comparison of the extracellular domain 
(ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
5 Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
consensus sequence is designated herein "DNA67003 n . 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID N0:417) was identified in a 
human pancreas library. DNA sequencing of the clone gave the full-length DNA sequence for PRO 1375 and 
10 the derived protein sequence for PRO 1375. 

The entire coding sequence of PRO 1375 is shown in Figure 299 (SEQ ID NO:4 17). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translations! initiation site at nucleotide positions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID N0:417. The predicted 
polypeptide precursor is 198 amino acids Long. The transmembrane domains are at about amino acids 1 1-28 
15 (type II) and 103-125 of SEQ ID N0:418. Clone DNA67004-1614 has been deposited with ATCC and is 
assigned ATCC deposit no. 203115. The full-length PR01375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pi of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID NO:4 1 8). revealed sequence identity 
20 between the PROl 375 amino acid sequence and the following Dayhoff sequences: AF026198 5, CELR12C 12_5, 
S73465, Y011_MYCPN, S64538J, PJ>8150, MUVSHPO10J, VSH_MUMPL and CVU59751 J. 

EXAMPLE 136 ; Isolation of cDNA clones Encoding Human PRO 1385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seatde, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 301 and is herein designated as DNA68869-16I0. 
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Clone DNA68869-1610 contains a single open reading frame with an apparent translations! initiation 
site at nucleotide positions 26-28 and ending at the stop codon at nucleotide positions 410-4 12 (Figure 301). The 
predicted polypeptide precursor is 128 amino acids long (Figure 302). The full-length PRO 1385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 daltons and a pi of about 10.97. Analysis of 
the full-length PR01385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid I to about amino acid 28, and glycosylamiiioglycan attachment 
sites from about amino acid 82 to about amino acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 
203164. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO:420), evidenced low homology 
between the PR01385 amino acid sequence and the following Dayhoff sequences: CELT14A8J, 
LMNACHRA1J, HXD9HUMAN, CHKCMLFl, HS5PP34_2. DMDRINGJ, A37I07J, 
MMLUNGENEJ, PUM_DROME and DMU25I17J. 

15 EXAMPLE 137 : Isolation of cDNA clones Encoding Human PRQ1387 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ®. Incyte Pharmaceuticals, Palo Alio, CA) to identify existing homologies. The 

20 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
finrvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or io some cases 90) 
or greater thai did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green. University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56259. 

25 In light of an observed sequence homology between the DNA56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA68872-1620 contains a single open reading frame with an apparent translational initiation 

30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 amino acids long (Figure 304). The full-length PRO 1387 protein 
shown in Figure 304 has an estimated molecular weight of about 44,339 daltons and a pi of about 7.10. Analysis 
of the full-length PR01387 sequence shown in Figure 304 (SEQ ID N0:422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about 

35 amino acid 275 to about amino acid 296, potential N-glycosylation sites from about amino acid 76 to about amino 
acid 79, from about amino acid 231 to about amino acid 234, from about amino acid 302 to about amino acid 
305, from about amino acid 307 to about amino acid 310 and from about amino acid 376 to about amino acid 
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379, and amino acid sequence blocks having homology to myelin pO protein from about amino acid 210 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-1620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), usinga WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO:422), evidenced significant 
5 homology between the PR01387 amino acid sequence and the following Dayhoff sequences: P_W36955, 
MYP0_HETFR, HS46KDA_1, AF049498J, MYO0_HUMAN. AF030454J, A53268, SHPTCRAJ, 
PW14146 and GEN 12838. 

EXAMPLE 138: Isolation of cD NA clones Encoding Human PRO 1384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54192 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01384. 
PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5'-TGCAGCCCCTGTGACACAAACTGG-3' (SEQ ID NO:425) 

reverse PCR primer 5 '-CTGAGATAACCGAGCCATCCTCCCAC-3 ' (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 

sequence which had the following nucleotide sequence: 

hybridization probe 

20 S'-GGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAG-S' (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01384 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01384 (designated herein as DNA71 159-1617 [Figure 305, SEQ ID NO:423]; and the derived protein 

sequence for PR01384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423) . Clone DN A7 1 1 59- 
1617 contains a single open reading frame with an apparent translational initiation she at nucleotide positions 

30 182-184 and an apparent stop codon at nucleotide positions 869-87 1 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PRO 1384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potential N-glycosy lation sites at about amino acids 68-7 1 , 120- 123, and 1 34- 1 37 . 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PR01384 amino acid sequence and the following Dayhoff sequences: AF054819J, HSAJ1687 1, 
AF0O9511_l,AB0lO71OJ,GEN13595,HS^ 
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and NK13_RAT. 

r.lnne DNA71 159-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139 : Use of PRO a « » hvhri Nation probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 

DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as 
a probe to screen for homologous DNAs (such as those encoding narurally-occurring variants of PRO) in human 
tissue cDNA libraries or human tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
solution of 50% formaraide, 5x SSC, 0. 1 % SDS, 0.1 % sodium pyrophosphate, 50 mM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0. lx SSC and 0.1% SDS at 42°C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can 
then be identified using standard techniques known in the art. 

fx ample 140: Expression of Pfi Q in E. coli 

This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers 
should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is 
pBR322 (derived from E. coli\ see Bolivar et al.. Geng, 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylaied. The PCR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixture is then used to transform a selected E. coli strain using the methods described in 
Sambrook et al. , sjrora, Transfonnants are identified by their ability to grow on LB plates and antibiotic resistant 
colonies are then selected, Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are 
then grown to a desired optical density, during which the expression promoter is turned on. 

After culruring the cells for several more hours, the cells can be harvested by centrifugation. The cell 
pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in E. coli in a poly-His lagged form, using the following procedure. The DN A 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
5 column, and proteolytic removal with enterokinase. The PCR- amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB containing 50 mg/ml 
carbenicillin at 30 °C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH 4 ) 2 S0 4 , 0.71 g sodium citrate°2H20, 1.07 g KC1, 5.36 g Difco 

10 yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgS0 4 ) and grown for approximately 20-30 hours at 30 °C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 

15 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. IM and 0,02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 

20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nra 
using the calculated extinction coefficient based on its amino acid sequence. 

25 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0,3 M NaCl, 2.5 M urea, 5 mM cysteine. 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the 

30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1% 
TFA with elution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 

35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin 
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from the samples. 
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a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile filtered. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



FX AMPLE 141 : Expression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
10 The vector, pRK5 (see EP 307,247, published March 15, 1989), is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook et al. , supra. The resulting vector is called pRK5- 
PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
15 grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics, About 10 M g pRK5-PRO DNA is mixed with about I /ig 
DNA encoding the VA RNA gene [Thimmappaya et al., £eJl, li :543 < 1982 » dissolved in 500 pi of I mM 
Tris-HCl, 0. 1 mM EDTA, 0.227 M CaCl 2 . To this mixture is added, dropwise, 500 p\ of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1,5 mM NaP0 4 , and a precipitate is allowed to form for 10 minutes at 25°C. The 
20 precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37°C. The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture 
medium (alone) or culture medium containing 200 jiCi/ml )5 S-cysteine and 200 M Ci/ml 33 S-methionine. After 
25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filteT, and loaded onto a 15 % 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 
30 method described by Somparyrac et al., Proc. Natl. Acad. Sci.. 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 ^g pRK5-PRO DNA is added. The cells are first concentrated from 
the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
medium, and re-introduced into the spinner flask containing tissue culture medium, 5 Mg/ml bovine insulin and 
35 0. 1 /ig/ml bovine transferrin. After about four days, the conditioned media is cemrifuged and filtered to remove 
cells and debris. The sample containing expressed PRO can then be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 
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In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be iransfected into 
CHO cells using known reagents such as CaPO* or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
35 S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned into a SV40 

1 0 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni 2+ -chelate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 

15 cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgG 1 constant region sequence containing the hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 

20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology. Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3* 
of the DNA of interest to allow the convenient shuttling of cDNA s. The vector used expression in CHO cells 
is as described in Lucas et ah, Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 

25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfection reagents Superfect* (Quiagen), Dosper° or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10' 7 cells are frozen 

30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by 
vortexing. The contents are pipetted into a centrifuge rube containing 10 mLs of media and cemrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
^m filtered PS20 with 5% 0.2 /2m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 

35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 raL selective growth medium and incubated at 37<?C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 5 cells/mL. The cell media is exchanged with fresh media by 
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centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1 .2 x 10 6 cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commence! On day 2, the spinner is sampled, the 
temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% ami foam (e.g., 35% 
5 polydimethylsiloxane emulsion, Dow Corning 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70% , the cell culture is harvested by centrifugation and filtering through a 0.22 filter. The filtrate was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 

10 purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes. 0. 14 M NaCl and 4% 

15 mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80*0. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8, After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 

20 ml fractions into tubes containing 275 of 1 M Tris buffer, pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
by SDS polyacrylarmde gels and by N-terrainal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

25 EXAMPLE 142 : Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DN A encoding PRO and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can 

30 be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast supemaiants can be analyzed by 

35 precipitation with 10% trichloroacetic acid and separation by SDS -PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



5 EXAMPLE 143: Expression of PRO in Baculovirus-infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 

10 as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of the coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into ihe expression vector. 

15 Recombinant baculovirus is generated by co-transfecting the above piasmid and BaculoGold™ virus 

DNA (Pharraingen) into Spodoptera fiugiperda rSf9') cells (ATCC CRL 1711) using lipofectin (commercially 
available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 
al„ Baculovirus expression vector s: A Laboratory Manual. Oxford: Oxford University Press (1994). 

20 Expressed poly-his tagged PRO can then be purified, for example, by Ni 2+ -cheiate affinity 

chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al., Nature. 175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mLHepes, pH 7.9; 12.5 mM MgCl 2 ; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-foid 

25 in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 urn 
filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
ml, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A 280 with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 

30 phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
A 2fl o baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His, 0 -tagged PRO are 
pooled and dialyzed against loading buffer. 

35 Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 

chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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EXAMPLE 144 : Preparation of Antibodies th at Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 
Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Goding, supra . Immunogens that may be employed include purified PRO, fusion proteins containing 
PRO, and cells expressing recombinant PRO on the celi surface. Selection of the immunogen can be made by 
5 the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
10 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody uter has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
15 are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
20 "positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
25 affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 145 : Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the an 
of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
30 is purified by immunoaffiniry chromatography using antibodies specific for the PRO polypeptide of interest. In 
general, an immunoaffiniry column is constructed by covalently coupling the anti-PRO polypeptide antibody to 
an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 
sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). 
35 Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 
chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 
chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 

5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO r>olypeptide-contaiiiing preparation is passed over the imiuunoaffinity column, and the 
column is washed under conditions that allow the preferentiaJ absorbance of PRO polypeptide (e.g., high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 

10 antibody/PRO polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3 , or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146: Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO porypeptide-associated disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypeptideyceU complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the an. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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solid support. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 

5 

EXAMPLE 147 : Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest (i.e. , a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 

10 PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c.f. , Hodgson, 
Bio/Technoloev. 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide-mhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained 

15 to elucidate the structure and to determine active stte(s) of the molecule. Less often, useful information regarding 
the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins. 
In both cases, relevant structural information is used to design analogous PRO polypeptide- like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by Braxton and Wells, Biochemistry. 11:7796-7801 (1992) or which act as 

20 inhibitors, agonists, or antagonists of native peptides as shown by Athauda et ai , J. Biochem,, 111:742-746 
(1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anti- idiotypic 

25 antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, 
the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then 
be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available 

30 to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 
in place of or in addition to x-ray crystallography. 
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of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 20110-2209, USA (ATCC): 



ms2 



5 


Material 


ATCC Dep. No. 


Pffposit Dai< 




DNA16422-1209 


209929 


June 2, 1998 




DNA16435-1208 


209930 


June 2, 1998 




DNA21624-1391 


209917 


June 2, 1998 




DNA23334-1392 


209918 


June 2, 1998 


10 


DNA26288-1239 


209792 


April 21, 1998 




DNA26843-1389 


203099 


August 4, 1998 




DNA26844-I394 


209926 


June 2, 1998 




DNA30862-1396 


209920 


June 2, 1998 




DNA35680-1212 


209790 


April 21, 1998 


15 


DNA40621-1440 


209922 


June 2, 1998 




DNA44161-1434 


209907 


May 27, 1998 




DNA44694-1500 


203114 


August 11, 1998 




DNA45495-1550 


203156 


August 25, 1998 




DNA47361-1154 


209431 


November 7, 1997 


20 


DNA47394-1572 


203109 


August 11, 1998 




DNA48320-1433 


209904 


May 27, 1998 




DNA48334-1435 


209924 


June 2. 1998 




DNA48606-1479 


203040 


July 1, 1998 




DNA49141-1431 


203003 


June 23, 1998 


25 


DNA49142-1430 


203002 


June 23, 1998 




DNA49 143- 1429 


203013 


June 23, 1998 




DNA49647-1398 


209919 


June 2, 1998 




DNA498 19-1439 


209931 


June 2, 1998 




DNA49820-1427 


209932 


June 2, 1998 


30 


DNA49821-1562 


209981 


June 16, 1998 




DNA52192-1369 


203042 


July 1, 1998 




DNA52598-1518 


203107 


August 11, 1998 




DNA53913-1490 


203162 


August 25, 1998 




DNA53978-1443 


209983 


June 16, 1998 


35 


DNA53996-1442 


209921 


June 2, 1998 




DNA56041-1416 


203012 


June 23, 1998 




DNA56047-1456 


209948 


June 9, 1998 




DNA56050-1455 


203011 


June 23, 1998 




DNA561 10-1437 


203113 


August 11, 1998 


40 


DNA561 13-1378 


203049 


July I, 1998 




DNA56410-1414 


209923 


June 2, 1998 




DNA56436-1448 


209902 


May 27, 1998 




DNA56855-1447 


203004 


June 23, 1998 




DNA56859-1445 


203019 


June 23, 1998 


45 


DNA56860-1510 


209952 


June 9, 1998 




DNA56865-1491 


203022 


June 23, 1998 




DNA56866-1342 


203023 


June 23, 1998 




DNA56868-1209 


203024 


June 23, 1998 




DNA56869-1545 


203161 


August 25, 1998 


50 


DNA56870-1492 


209925 


June 2, 1998 




DNA57033-1403 


209905 


May 27, 1998 




DNA57037-1444 


209903 


May 27, 1998 




DNA57129-1413 


209977 


June 16, 1998 
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30 



DNA57690-1374 


209950 


June 9, 1998 


DNA57693-1424 


Oi/\0 

203008 


June 23 , 1 998 


nNA.S7(S94-l341 


203017 


June 23, 1998 


DNA57695-1340 


203006 


June 23, 1998 


DNA57699-1412 


203020 


I. ,_ _ >-\ 'S IAAQ 

June 23, 1998 


DNA57702-1476 


209951 


June 9, 1998 


DNA57704-1452 


209953 


june 9, 1998 


DNA57708-I411 


203021 


June 23, 1998 


DNA57710-1451 


203048 


July 1, 1998 


DNA577 11-1501 


2U3U47 


July 1, 1998 


DNA57827-1493 


203045 


July 1, 1998 


DNA57834-1339 


209954 


June 9, 1998 


DNA57836-1338 


203025 


June 23, 1998 


DNA57838-1337 


203014 


June 23, 1998 


DNA57844-1410 


203010 


June 23, 1998 


DNA5872 1-1475 


203110 


August 11, 1998 


DNA58723-1588 


203133 


August 18, 1998 


DNA58737-I473 


203136 


August 18, 1998 


DNA58743-1609 


203154 


August 25. 1998 


DNA58846-1409 


209957 


June 9, 1998 


DNA58848-1472 


209955 


June 9, 1998 


DNA58849-1494 


209958 


June 9, 1998 


DNA5 8850-1495 


209956 


June 9, 1998 


DNA5 8853-1423 


203016 


June 23, 1998 


DNA58855-1422 


203018 


June 23, 1998 


DNA59205-1421 


203009 


June 23, 1998 


DNA5921 1-1450 


209960 


June 9, 1998 


DNA59213-1487 


209959 


June 9, 1998 


DNA59214-1449 


203046 


July 1, 1998 


DNA59215-1425 


209961 


June 9, 1998 


DNA59220-1514 


209962 


June 9, 1998 


DNA59488-1603 


203157 


August 25, 1998 


DNA59493-1420 


203050 


July 1, 1998 


DNA59497-1496 


209941 


June 4, 1998 


DNA59588-1571 


203106 


August 11, 1998 


DNA59603-1419 


209944 


June 9, 1998 


DNA59605-1418 


203005 


June 23, 1998 


DNA59606-1471 


209945 


June 9, 1998 


DNA59607-1497 


209957 


June 9, 1998 


DNA59609-1470 


209963 


June 9, 1998 


DNA59610-1559 


209990 


t 1/1 

June 16, 1998 


DNA59612-1466 


209947 


June 9, 1998 


DNA59613-1417 


203007 


T o *\ t AAA 

June 23, 1998 


DNA59616-1465 


209991 


June 16, 1998 


DNA59619-1464 


203041 


July 1, 1998 


DNA59620-1463 


209989 


June 16, 1998 


DNA59625-1498 


209992 


June 17, 1998 


DNA59767-1489 


203108 


August 11, 1998 


DNA59776-1600 


203128 


August 18, 1998 


DNA59777-1480 


203111 


August 11, 1998 


DNA59820-1549 


203129 


August 18, 1998 


DNA59827-1426 


203089 


August 4, 1998 


DNA59828-1608 


203158 


August 25, 1998 


DIM A59838- 1462 


209976 


June 16, 1998 


DNA59839-1461 


209988 


June 16, 1998 


DNA5984 M460 


203044 


July 1, 1998 


DNA59842-1502 


209982 


June 16, 1998 
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10 



15 



20 



25 



30 



35 



40 



45 



D N A59846- 1 503 


^aaato 


junc io, [Wo 


yw T A rAO A ^ 1C1 1 

DNA59847-1511 


oataoq 

zojuvb 


AUgUSl *t, I770 


DNA59848-1512 


iA'Saqo 


Aitmicf A 1 QOff 

August iyyo 


DN A59 84V- 1 5W 




| 1in A ]£ IQOfi 

j unc 1 0 1 1 77o 


T%Vt A (AO 1 C/VC 

DN A59853- 1 505 


2UWoj 


JliUC 10, I77O 


DN A5V854- 145 V 




Inn** Irt IQQfi 

June 10, 1770 


DNA60283-1484 


iUiU4J 


July 1, l77o 


DNA60615-1483 


0AOO2A 
ZU770U 


Inn** 1 A lOOfi 
JUI1C 10, 1770 


f\KI A £T\C 1 ft 1/1 

DN AoOo 1 y- 1482 




June 10, I770 


DNA60621-1516 


OAT AO. 1 


AugUSt 4, ItVo 


DNA60622- 1 525 


OA"3AQA 


August 4, iyya 


DN A60625- 1 507 




Tuna 1 £ 1 OAQ 

June 10, I770 


DNA60627-1508 


OA'S Am 


AUgUSt 4, lWo 


DNA60629-1481 


209V79 


June 10, 1998 


DNA61755-1554 


2031 12 


A ■ mix.* 1 1 1 ADO 

AUgUSt 11, IW0 


DNA61 873-1574 


203132 


AUgUSt 18, 1998 


DNA62814-1521 


203093 


August 4, 1998 


DNA62872-1509 


203100 


August 4, ivy 8 


DNA62876-1517 


203095 


August 4, 1998 


DNA62881-1515 


203096 


August 4, 1998 


V T A ^ At**'* 1 fen 

DNA64 852- 1589 


203127 


August 18, 1W8 


DNA64884-1527 


203155 


August 25, 1998 


DNA64890-1612 


203131 


August 18, 1998 


DNA65412-1523 


203094 


August 4, 1998 


DNA66308-1537 


203159 


August 25, 1998 


DNA66309-1538 


203235 


September 15, 1998 


DNA67004-1614 


203115 


August 11, 1998 


DNA68869-1610 


203164 


August 25, 1998 


DNA68872-1620 


203160 


August 25, 1998 


DNA71 159-1617 


203135 


August 18, 1998 



These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1 . 14 with particular reference to 
886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
that ar* fyncrimwity equivalent are within the scope of this invention, The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30) t Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36), Figure 27 (SEQ ID NO:41), 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID 
NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ 
ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99). Figure 57 (SEQ ID NO: 103), Figure 64 
(SEQ ID NO:l 13), Figure 66 (SEQ ID NO: 115), Figure 68 (SEQ ID NO: 1 17), Figure 70 (SEQ ID NO: 119), 

10 Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO:138), Figure 83 (SEQ ID NO:146), Figure 85 (SEQ ID NO:148), Figure 88 (SEQ ID NO:151), Figure 90 
(SEQ ID NO:153), Figure 93 (SEQ ID NO:156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), 
Figure 99 (SEQ ID NO:165), Figure 101 (SEQ ID NO:167), Figure 103 (SEQ ID NO:169), Figure 105 (SEQ 
ID NO:171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ ID NO: 177), Figure 113 (SEQ ID NO:179), 

15 Figure 115 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ 
ID NO:194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ ID NO: 199), Figure 129 (SEQ ID NO:201), 
Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137 (SEQ 
ID NO:209), Figure 139 (SEQ ID NO:211), Figure 141 (SEQ ID N0:213), Figure 144 (SEQ ID NO:216), 
Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ 

20 ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure 159 (SEQ ID NO:231), 
Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:24I) t Figure 165 (SEQ ID NO:246), Figure 167 (SEQ 
ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253), Figure 174 (SEQ ID NO:256), 
Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), Figure 180 (SEQ ID NO:262), Figure 182 (SEQ 
ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO:268), Figure 188 (SEQ ID NO:270), 

25 Figure 190 (SEQ ID NO:272). Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ 
ID NO:278), Figure 198 (SEQ ID NO;281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ 
ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), 
Figure 218 (SEQ ID NO:30I), Figure 220 (SEQ ID NO:303), Figure 226 (SEQ ID NO:309), Figure 228 (SEQ 

30 ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), 
Figure 238 (SEQ ID NO:340). Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ 
ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), 
Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 258 (SEQ 
ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), 

35 Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381), Figure 278 (SEQ ID NO:387), 
Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ 
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ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID NO:410), 
Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416), Figure 300 (SEQ 
ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
5 sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ 
ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 1 1 (SEQ ID NO: 19), Figure 14 (SEQ 
ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ ID NO:32), Figure 24 
(SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 (SEQ ID NO:51), Figure 
33 (SEQ ID NO:56) r Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 40 (SEQ ID NO:72), 

10 Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID 
NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 11 2). Figure 65 (SEQ ID NO: 1 14), Figure 67 
(SEQ ID N0:116), Figure 69 (SEQ ID N0:118), Figure 71 (SEQ ID NO:123), Figure 73 (SEQ ID NO: 128), 
Figure 75 (SEQ ID NO:134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), Figure 84 (SEQ ID 
NO:I47), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO:152), Figure 92 (SEQ ID NO: 155), Figure 94 

15 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 (SEQ ID NO: 166), 
Figure 102 (SEQ ID NO: 168). Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID NO: 174), Figure 1 10 (SEQ 
ID NO; 176), Figure 112 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180), Figure 116 (SEQ ID NO: 182), 
Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193), Figure 124 (SEQ ID NO:196), Figure 126 (SEQ 
ID NO:198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204), 

20 Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210), Figure 140 (SEQ 
ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220), 
Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249), Figure 170 (SEQ 

25 ID NO:252). Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 205 (SEQ ID NO:288), 

30 Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID NO:294), Figure 213 (SEQ 
ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302), 
Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ 
ID NO:325), Figure 234 (SEQ ID NO:333). Figure 237 (SEQ ID NO:339), Figure 239 (SEQ ID NO:344), 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ 

35 ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364), Figure 261 (SEQ 
ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), 



502 



WO 99/63088 



PCT/US99/12252 



Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 295 (SEQ ID NO:413), 
Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
5 ID NO:421) and Figure 305 (SEQ ID NO:423). 

3 . The nucleic acid of Claim 1 , wherein said nucleotide sequence comprises a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of the sequence shown in Figure 1 (SEQ 
ID NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO:13), Figure 11 (SEQ ID 

10 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40). Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO: 72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

15 NO:114), Figure 67 (SEQ ID NO:116), Figure 69 (SEQ ID NO:118), Figure 71 (SEQ ID NO:i23), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO:155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

20 NO:174), Figure 110 (SEQ ID NO:176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO:180), Figure 
116 (SEQ ID NO:182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

25 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247) t Figure 168 (SEQ ID 
NO:249). Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:26I), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 

30 NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID N0.269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID N0.277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID N0:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 

35 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350), Figure 247 (SEQ ID NO:352). Figure 249 (SEQ ID NO:354), Figure 25 1 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366). Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
5 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 
295 (SEQ ID N0:413), Figure 297 (SEQ ID N0:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

10 .4. Isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited 

under any ATCC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1 . 

15 6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 

transformed with the vector. 

7. A host cell comprising the vector of Claim 5. 

20 8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an E. coli. 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 

25 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

30 12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 

selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID NO:4 1), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 

35 34 (SEQ ID NO:57), Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73). 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 113), Figure 66 (SEQ ID NO:l 15), Figure 68 

504 



WO 99/63088 



PCT/US99/12252 



(SEQ ID NO:l 17), Figure 70 (SEQ ID NO: 119), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129), 
Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146). Figure 85 (SEQ ID 
NO: 148), Figure 88 (SEQ ID NO: 151), Figure 90 (SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 
(SEQ ID NO:158), Figure 97 (SEQ ID NO:160), Figure 99 (SEQ ID NO:165), Figure 101 (SEQ ID NO: 167), 
Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 1 1 1 (SEQ 
5 ID NO:177), Figure 113 (SEQ ID NO:179), Figure 115 (SEQ ID NO:181), Figure 117 (SEQ ID NO:183), 
Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ ID NO:194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ 
ID NO: 199), Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), 
Figure 135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID NO:21 1), Figure 141 (SEQ 
ID NO;213)» Figure 144 (SEQ ID NO:216) t Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221). 

10 Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225). Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276). Figure 196 (SEQ ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ 
ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295). Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), 

20 Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), 
Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349). Figure 246 (SEQ ID NO:35l), Figure 248 (SEQ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375). Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379). Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389). Figure 282 (SEQ ID NO:394), 
Figure 284 (SEQ ID NO:399) f Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412). Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 2, 

35 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region oi an immunoglobulin. 

5 

17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 
10 19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

21 . An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
15 which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 

NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22). Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32). Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 

20 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NOtl 12), Figure 65 (SEQ ID 
NO: 1 14), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO: 1 18). Figure 7 1 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145). 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 

25 NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 
NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188). Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO:198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 

30 132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 

35 NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257) ( Figure 
177 (SEQ ID NO:259). Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265). Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:27I), Figure 



506 



WO 99/63088 



PCT/US99/12252 



191 (SEQ ID NO:273). Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280). Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294). Figt >re 2 1 3 (SEQ ID NO : 296) , Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344). Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 

10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
281 (SEQ ID NO:393). Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 

15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure I (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40). Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51). Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 NO: 114), Figure 67 (SEQ ID NO: 116), Figure 69 (SEQ ID NO: 118), Figure 71 (SEQ ID NO: 123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO:134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

30 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198). Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208). Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224). Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235). Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245). Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
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NO:249), Figure 170 (SEQ ID NO:252) t Figure 173 (SEQ ID NO:255) ? Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO;269), Figure 189 (SEQ ID NO:27l), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277). Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:2S6). Figure 
5 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 

10 NO:350), Figure 247 (SEQ ID NO:352>, Figure 249 (SEQ ID NO:354), Figure 25 1 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360)> Figure 257 (SEQ ID NO:362). Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368). Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372). Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376). Figure 273 (SEQ ID 
NO:378). Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID N0:400) ? Figure 287 (SEQ ID 
NO:402). Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413). Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID N0:417), Figure 301 (SEQ ID 
NO:419). Figure 303 (SEQ ID NO:421) awl Figure 305 (SEQ ID NO:423). 

20 23. An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide, 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 



CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 
CCTGGTGCACCACCACCaiGTTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 
GTTTTCCACCCAGCTTTCACCAAGGCCTCCCCTGTTGTGAAGAATTCCATCACGAAGAATCA 
ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAACAAGAATTGGGATCCGGCGTGGGA 
GAACTGGCCAAGAACTCAAAGAGGCAGCATTGGAACCATCGATGGAAAAAATATTTAAAATT 
GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGCTGTTGGTCTTGGAGCATTGTGCTA 
CTATGGCTTGGGACTGTCTAATGAGATTGGAGCTATTGAAAAGGCTGTAATTTGGCCTCAGT 
AT GT CAAGGATAGAATTCATTC CACCTATATG TACTTAGCAGGGAGTATTGGTTTAACAG CT 
TTGTCTGCCATAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTCTTG 
GGTGACAATTGGTGTGACCTTTGCAGCCATGGTTGGAGCTGGAATGCTGGTACGATCAATAC 
CATATGACCAGAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTACATTCTGGTGTGATGGGT 
GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCTGCATGGTACAC 
AGCTGGCATTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCCAGTGAAAAGTTTCTGA 
ACATGGGTGCACCCCTGGGAGTGGGCCTGGGTCTCGTCTTTGTGTCCTCATTGGGATCTATG 
TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCAATGTACGGTGGATT 
AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGAAAGTAATCAAGCGTGCAGAAGTAT 
CACCAATGTATGGAGTTCAAAAATATGATCCCATTAACTCGATGCTGAGTATCTACATGGAT 
ACATTAAATATATTTATGCGAGTTGCAACTATGCTGGCAACTGGAGGCAACAGAAAGAAATG 
AAGTGACTCAGCTTCTGGCTTCTCTGCTACATCAAATATCTTGTTTAATGGGGCAGATATGC 
ATTAAATAGTTTGTACAAGCAGCTTTCGTTGAAGTTTAGAAGATAAGAAACATGTCATCATA 
TTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 
AATC CT CTCC CAAATAAG CACACACATTTTCAATT CTCATGTTTGAGTGATTTTAAAATGTT 
TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTTCTACTTTAAAA 
TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCTGTGTTTGCATATTTTTTTGGAGT 
GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 
GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 
GTGAGGAGCCAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 
GCTGAACTTAACAAAACTGTTCATCCTGAAACAGGCACAGGTGATGCATTCTCCTGCTGTTG 
CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGACTTGTACAGAATGTTAATC 
ATACAGAGAAT CCTTG ATGG AATTAT ATATGT GTGTTTTACTTTTGAATGTTACAAAAGGAA 
ATAACTTTAAAACTATTCTCAAGAGAAAATATTCAAAGCATGAAATATGTTGCTTTTTCCAG 
AATACAAACAGTATACTCATG 
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FIGURE 2 



MLAARLVCLRTLPSRVFHPAFTKASPWKNSITKNQWLLTPSREYATKTRIGIRRGRTGQEL 
KEAALEPSMEKIFKIDQMGRWFVAGGAAVGLGALCYYGLGLSNEIGAIEKAVIWPQYVKDRI 
HSTYMYLAGS IGLTAL.SAI AI SRT PVLMNFMMRGS WVT I G VTFAAMVGAGML VRS I PYDQSP 
GPKHLAWLLHSGVMGAWAPLTILGGPLLIRAAWYTAGIVGGLSTVAMCAPSEKFIJWGAPL 
GVGLGLVFVSSLGSMFLPPTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVIKRAEVSPMYGV 
QKYDP INSMLS I YMDTLNI FMRVATMLATGGNRKK 
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FIGURE 3 

GAAGGCTGCCTCGCTGGTCCGAATTCGGTGGCGCCACGTCCGCCCGTCTCCGCCTTCTGCAT 

CGCGGCTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCCGCCGCGTCGTGAGGG 

GGTCGGCACGGGGAGTCGGGCGGTCTTGTGCATCTTGGCTACCTGTGGGTCGAAGATGTCGG 

ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCGCCGCCACCGTC 

GCCGTGCCCTTGGTCGGCAAACTCGGCCTCATCAGCCCGGCCTACCTCTTCCTCTGGCCCGA 

AGCCTTCCTTTATCGCTTTCAGATTTGGAGGCCAATCACTGCCACCTTTTATTTCCCTGTGG 

GTCCAGGAACTGGATTTCTTTATTTGGTCAATTTATATTTCTTATATCAGTATTCTACGCGA 

CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACTATTTATTCATGCTCCTCTTTAACTG 

GATTTGCATCGTGATTACTGGCTTAGCAATGGATATGCAGTTGCTGATGATTCCTCTGATCA 

TGT CAGTACTTTATGTCTGGGCCCAGCTGAACAGAGACATGATT GT ATCATTTTGGTTTGGA 

ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATCCTTGGATTCAACTATATCATCGGAGG 

CTCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATTTTTTCCTAATGTTCA 

GATACC CAATGGACTTGGGAGGAAGAAATTTTCTAT CCACACCTCAGTTTTTGTACCGCTGG 

CTGC CCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGC CC CCTG CT AG CATGAGGCGAGC 

TGCTGATCAGAATGGCGGAGGCGGGAGACACAACTGGGGCCAGGGCTTTCGACTTGGAGACC 

AGTGAAGGGGCGGCCTCGGGCAGCCGCTCCTCTCAAGCCACATTTCCTCCCAGTGCTGGGTG 

CACTTAACAACTGCGTTCTGGCTAACACTGTTGGACCTGACCCACACTGAATGTAGTCTTTC 

AGTACGAGACAAAGTTTCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 

TCTCATTCAAGTCCTTACTGCTGTGAAGAACAAATACCAACTGTGCAAATTGCAAAACTGAC 

TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAGCGGGTCCT 

AATCTGCTGGCATTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGACCTTATCTCTT 

TCTTGCACACATGCCTCTCTCCCACTTTTCCCAACCCCCACATTTGCAACTAGAAAAAGTTG 

CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAAGGCTGGTC 

ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAACTGTTACCAATAGGGGGAG 

AAGACAGCCACGGATGAAGCGTTTCTCAGCTTTTGGAATTGCTTCGACTGACATCCGTTGTT 

AACCGTTTGCCACTCTTCAGATATTTTTTATAAAAAAAGTACCACTGAGTTCATGAGGGCCA 

CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGGTGTTTGTTTCCTGAGCTAAGTGA 

TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 

TTTGCGTTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTGTGT 

TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 

ATTC CCATTTCAAT CT CATT CTGGATATGTGTTCATTG AGTAAAGGAGGAGAGAC C CTCATA 

CGCTATTTAAATGTCACTTTTTTGCCTATCCCCCGTTTTTTGGTCATGTTTCAATTAATTGT 

GAGGAAGGCGCAGCTCCTCTCTGCACGTAGATCATTTTTTAAAGCTAATGTAAGCACATCTA 

AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 

TTTTGAGTCATGAATGTACAAGCTCTGTGAATCAGACCAGCTTAAATACCCACACCTTTTTT 

TCGTAGGTGGGCTTTTCCTATCAGAGCTTGGCTCATAACCAAATAAAGTTTTTTGAAGGCCA 

TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAGCAGACTGTTAGGAGCAGT 

ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAAACGTTTTGATCAGTTTCTT 

TTCAGGAAACATTGTG CT CTAACAGTATGACTATTCTTTC CC C CACTCTTAAACAGTGTGAT 

GTGTGTTATC CTAGGAAATGAGAGTTGG CAAACAACTTCTCATTTTGAATAGAGTTTGTGTG 

TACTTCTCCATATTTAATTTATATGATAAAATAGGTGGGGAGAGTCTGAACCTTAACTGTCA 

TGTTTTGTTGTTCATCTGTGGCC^CAATAAAGTTTACTTGTAAAATTTTAGAGGCC 

CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGTATAAGAATA 

TTTCTGACAGTGAGTGACCCGGAGTCTCTGGTGTACCCTCTTACCAGTCAGCTGCCTGCGAG 

CAGTC^TTTTTTCCTAAAGGTTTACAAGTATTTAGAACTTTTCAGTTCAGGGCAAAATGTTC 

ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTGATGACGTTATTGATTTTGTCTGGATT 

ATGTTTCTGGAATAATTTTACCAAAACAAGCTATTTGAGTTTTGACTTGACAAGGCAAAACA 

TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAAAGGACTTCCC 

TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTGCAACTTG 
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FIGURE 4 

MSDIGDWFRSIPAITRYWFAATVAVPLVGKLGLISPAYLFLWPEAFLYRFQIWRPITATFYF 
;?VGPGTGFL YLVNLYFLYQYSTRLETGAFDGRPAD YLFMLLFNW I C I V I TGLAMDMQLLM I P 
LIMSVLYVWAQLNRDMIVSFWFGTRFKACYLPWVILGFNYI IGGSVINELIGNLVGHLYFFL 
MFRYPMDLGGRNFLSTPQFLYRVnjPSRRGGVSGFGVPPASMRRAADQNGGGGRHNWG 
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FIGURES 

GGGGCCGCGGTCTAGGGCGGCTACGTGTGTTGCCATAGCGACCATTTTGCATTAACTGGTTG 

GTAGCTTCTATCCTGGGGGCTGAGCGACTGCGGGCCAGCTCTTCCCCTACTCCCTCTCGGCT 

CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGCGGTCTGGCCTAGGGATCTTCCCCGTTGCC 

CCTTTGGGGCGGGATGGCTGCGGAAGAAGAAGACGAGGTGGAGTGGGTAGTGGAGAGCATCG 

CGGGGTTCCTGCGAGGCCCAGACTGGTCCATC CC CATCTTGGACTTTGTGGAACAGAAATGT 

GAAGTTAACTG C AAAGGAGGGC ATGTGATAACTC CAGG AAGC CCAGAG C CGGTGATTTTGGT 

GGCCTGTGTTCCCCTTGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGATTC 

ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTT AG AAGGTTAC CT CAAAGAAATTGGAATT 

AATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAGGC 

CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCCAGA 

AAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTACCT 

GACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAATCCT 

GAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAAGAAGAAAGGAAGAGGAAAA 

AACAGTTAT CAGAGG CTAAAACAG AAGAGC CC AC AGTG CATTCCAGTG AAGC TGCAAT AATG 

AATAATTCC CAAGGGGATGGTGAACATTTTGCACAC C C AC C C T CAGAAGTT AAAATGCATTT 

TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 

CACAAAAAGGCCTGAAGATTCCTGGCTTAGAGCATGCGAGCATTGAAGGACCAATAGCAAAC 

TTAT CAGT ACTTGGAACAGAAGAACTTCGG CAACGAG AACACTAT CTCAAG CAGAAGAGAGA 

TAAGTTGATGTCCATGAGAAAGGATATGAGGACTAAACAGATACAAAATATGGAGCAGAAAG 

GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAGCAGAGGAGAAG 

C^UVACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTATTAATAAG^ 

&TAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTC 

CTTACACTG 
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FIGURE 6 

MAAEEEDEVEWWES I AGFLRGPDWSI P ILDFVEQKCEVNCKGGHVITPGSPEPVILVACVP 
LVFDDEEESKLTYTE IHQEYKELVEKLLEGYLKE IG INEDQFQEACTSPLAKTHTSQAILiQP 
VLAAEDFT I FKAMMVQKN I EMQLQAI R 1 1 QERNGVL PDCLTDGS D WSDLEHEEMKI LREVL 
RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHFANQS 
IEPLGRKVERSETSSLPQKGLKIPGLEHASIEGPIANLSVLGTEELRQREHYLKQKRDKLMS 
MRKDMRTKQIQNMEQKGKPTGEVEEMTEKPEMTAEEKQTLLKRRLLAEKLKEEVINK 
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FIGURE 7 

GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGA? AGC AAATTGAC CT ATAC AGAGAT 
TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAA 
TTAATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAG 
GCCATTTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCC 
AGAAAAACATTGAAATGCAG CTGCAAGC CATT CGAATAATTC AAGAGAGAAATGGTGTATTA 
CCTGACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAAT 
CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAA 
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GCGTGGTTTTTGTTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTG 
TAGCTTCTCCACGTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTG 
TCTCAGCTCTAGGATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 
AGTGGAATGGAAAAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATAC 
ATTCCTGCTAGGTGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATT 
CTGCCAATGAAGAAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTG 
GTGAAGCTAGTTTTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAG 
AAATTTGAAATATGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCT 
TTCTTTATTTCCTGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATG 
GCTGTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAA 
GAGGCGTCTAAACTGGATCCAGTGGGCTfCCCTCCTGACTTTATTTTTGTCTATTGTGGCCT 
TGACTGCCGGGACTAAAACTTTACAGCACAACTTGGCAGGACGTGGATTTCATCACGATGCC 
TTTTTCAGCCCTTCCAATTCCTGCCTTCTTTTCAGAAGTGAGTGTCCCAGAAAAGACAATTG 
TACAGCAAAGGAATGGACTTTTCCTGAAGCTAAATGGAACACCACAGCCAGAGTTTTCAGTC 
ACATCCGTCTTGGCATGGGCCATGTTCTTATTATAGTCCAGTGTTTTATTTCTTCAATGGCT 
AATATCTATAATGAAAAGATACTGAAGGAGGGGAACCAGCTCACTGAAAGCATCTTCATACA 
GAACAGCAAACTCTATTTCTTTGGCATTCTGTTTAATGGGCTGACTCTGGGCCTTCAGAGGA 
GTAACCGTGATCAGATTAAGAACTGTGGATTTTTTTATGGCCACAGTGCATTTTCAGTAGCC 
CTTATTTTTGTAACTGCATTCCAGGGCCTTTCAGTGGCTTTCATTCTGAAGTTCCTGGATAA 
CATGTTCCATGTCTTGATGGCCCAGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 
TCTTTGACTTCAGGCCCTCCCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 
TTTATTTATAATGCCAGCAAGCCTCAAGTTCCGGAATACGCACCTAGGCAAGAAAGGATCCG 
AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAGTGGGGATGGAGAAGAACTAGAAAGACTTA 
CCAAACCCAAGAGTGATGAGTCAGATGAAGATACTTTCTAACTGGTACCCACATAGTTTGCA 
GCTCTCTTGAACCTTATTTTCACATTTTCAGTGTTTGTAATATTTATCTTTTCACTTTGATA 
AACCAGAAATGTTTCTAAATCCTAATATTCTTTGCATATATCTAGCTACTCCCTAAATGGTT 

C CAT CCAAGGCTTAGAGT AC C CAAAGGCTAAG AAAT T C T AAAG AACTGATACAGGAGTAACA 
ATATGAAGAATTCATTAATATCTCAGTACTTGATAAATCAGAAAGTTATATGTGCAGATTAT 
TTTCCTTGGCCTTCAAGCTTCCAAAAAACTTGTAATAATCATGTTAGCTATAGCTTGTATAT 
ACACATAGAGATCAATTTGCCAAATATTCACAATCATGTAGTTCTAGTTTACATGCCAAAGT 
CTTCCCTTTTTAACATTATAAAAGCTAGGTTGTCTCTTGAATTTTGAGGCCCTAGAGATAGT 
CATTTTGCAAGTAAAGAGCAACGGGACCCTTTCTAAAAACGTTGGTTGAAGGACCTAAATAC 
CTGGCCATACCATAGATTTGGGATGATGTAGTCTGTGCTAAATATTTTGCTGAAGAAGCAGT 
TTCTCAGACACAACATCTCAGAATTTTAATTTTTAGAAATTCATGGGAAATTGGATTTTTGT 
AATAATCTTTTGATGTTTTAAACATTGGTTCCCTAGTCACCATAGTTACCACTTGTATTTTA 
AGTCATTTAAACAAGCCACGGTGGGGCTTTTTTCTCCTCAGTTTGAGGAGAAAAATCTTGAT 
GTCATTACTCCTGAATTATTACATTTTGGAGAATAAGAGGGCATTTTATTTTATTAGTTACT 
AATTCAAGCTGTGACTATTGTATATCTTTCCAAGAGTTGAAATGCTGGCTTCAGAATCATAC 
CAGATTGTCAGTGAAGCTGATGCCTAGGAACTTTTAAAGGGATCCTTTCAAAAGGATCACTT 
AGCAAACACATGTTGACTTTTAACTGATGTATGAATATTAATACTCTAAAAATAGAAAGACC 
AGTAATATATAAGTCACTTTACAGTGCTACTTCACACTTAAAAGTGCATGGTATTTTTCATG 
GTATTTTGCATGCAGCCAGTTAACTCTCGTAGATAGAGAAGTCAGGTGATAGATGATATTAA 
AAATTAGCAAACAAAAGTGACTTGCTCAGGGTCATGCAGCTGGGTGATGATAGAAGAGTGGG 
CTTTAACTGGCAGGCCTGTATGTTTACAGACTACCATACTGTAAATATGAGCTTTATGGTGT 
CATTCTCAGAAACTTATACATTTCTGCTCTCCTTTCTCCTAAGTTTCATGCAGATGAATATA 
AGGTAATATACTATTATATAATTCATTTGTGATATCCACAATAATATGACTGGCAAGAATTG 
GTGGAAATTTGTAATTAAAATAATTATT AAAC CT 
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MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSELVK 
LVFCVLVSFCVIKKDHQSRNLKYASWKEFSDFMKWSIPAFLYFLDNLIVFYVLSYLQPAMAV 
IFSNFSIITTALLFRIVLKRRLNWIQWASLLTLFLSIVALTAGTKTLQHNLAGRGFHHDAFF 
SPSNS CLLFR SE CP RKDNCTAKEWTF PE AKWNTTAR VF SH I RLGMGHVL 1 1 VQCF I S SMAN I 
YNEKILKEGNQLTES I FIQNSKLYFFGI LFNGLTLGLQRSNRDQI KNCGFFYGHSAFSVAL I 
FVTAFQGLSVAFILKFLDNMFHVLMAQWTVIITTVSVLVFDFRPSLEFFLEAPSVLLSIFI 
YNASKPQVPEYAPRQERIRDLSGNIiWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 10 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 
TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTGTAGCTTCTCCAC 
GTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTGTCTCAGCTCTAG 
GATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGAA 
AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATACATTCCTGCTAGG 
TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTGCCAATGAAG 
AAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTGGTGAAGCTAGTT 
TTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAGAAATTTGAAATA 
TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTTATTTCC 
TGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATGGCTGTTATCTTC 
TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAGAGGCGTCTAAA 
CTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCTTGACTGCCGGGA 
CTAAAACTTTA 
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FICIJME 11 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGCCGGCTTGGCTAGCGCGCGGCGGCC 
GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGCGGGGCAGAGGAGCAT 
CCCGTCTACCAGGTCCCAAGCGGCGTGGCCCGCGGGTCATGGCCAAAGGAGAAGGCGCCGAG 
AGCGGCTCCGCGGCGGGGCTGCTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 
GGTGAAGAAAGAACCGAAAAAGAAGAAACAACAGTTGTCTGTTTGCAACAAGCTTTGCTATG 
CACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCAGATCTAC 
CTATTGG&IgTGGCTCAGGTGGGCCCTTTCTCTGCCTCCATCATCCTGTTTGTGGGCCGAGC 
CTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATCCCCCTGGACCTGCC 
TGGGTCGCCTTATGCCCTGGATCATCTTCTCCACGCCCCTGGCCGTCATTGCCTACTTCCTC 
ATCTGGTTCGTGCCCGACTTCCCACACGGCCAGACCTATTGGTACCTGCTTTTCTATTGCCT 
CTTTGAAACAATGGTCACGTGTTTCCAtGTTCCCTACTCGGCTCTCACCATGTTCATCAGCA 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGAAGTGCTGGGCAC 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTGTTTCCAGG 
ACTTCAATAGCTCTACAGTAGCTTCACAAAGTGCCAACCATACACATGGCACCACTTCACAC 
AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTG 
TGCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTG 
AGCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTT 
ATTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTT 
TTGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCT 
CGGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCT 
GTATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAA 
CCTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTAC 
TACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCAT 
GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAA.GTTTGCCTCTGGAGTGTC 
ACTGGGCATTTCTACCCTCAGTCTGGACTTTGCAGGGTACCAGACCCGTGGCTGCTCGCAGC 
CGGAACGTGTCAAGTTTACACTGAACATGCTCGTGACCATGGCTCCCATAGTTCTCATCCTG 
CTGGGCCTGCTGCTCTTCAAAATGTACCCCATTGATGAGGAGAGGCGGCGGCAGAATAAGAA 
GGCCCTGCAGGCACTGAGGGACGAGGCCAGCAGCTCTGGCTGCTCAGAAACAGACTCCACAG 
AGCTGGCTAGCATCCTC2&SGGCCCGCCACGTTGCCCGAAGCCACCATGCAGAAGGCCACAG 
AAGGGATCAGGACCTGTCTGCCGGCTTGCTGAGCAGCTGGACTGCAGGTGCTAGGAAGGGAA 
CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTGCTCACTGTGGGGCCGGCTGCTCTG 
TGGCCTCCTGCCTCCCCTCTGCCTGCCTGTGGGGCCAAGCCCTGGGGCTGCCACTGTGAATA 
TGCCAAGGACTGATCGGGCCTAGCCCGGAACACTAATGTAGAAACCTTTTTTTTACAGAGCC 
TAATTAATAACTTAATGACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTGTGAGCTA 
TTAATGTTATTAATTTTCATAAAAGCTGGAAAGC 
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FKCUME 12 

MWLRWALSLPPSSCLWAEPGMPSQTPWWASA'-JANPPGPAWVALCPGSSSPRPWPSLPTSSSG 
SCPTSKTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTERDSATAYRMT^VLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
ILILGVREQREPYEAQQSEPIAYFRGLRLVMSHGPYIKLITGFLFTSLAFMLVEGNFVLFCT 
YTLGFRNEFQNLLLAIMLSATLTIPIWQWFLTRFGKKTAVYVGISSAVPFLILVALMESNLI 
ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLG 
ISTLSLDFAGYQTRGCSQPERVKFTLNMLVTMAPIVLILLGLLLFKMYPIDEERRRQNKKAL 
QALRDEASSSGCSETDSTELAS I L 
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FIGURE 13 

GGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTGT 
GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTGA 
GCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTTA 
TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTTT 
TGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCTC 
GGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCTG 
TATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAAC 
CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTACT 
ACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCATG 
GAACCGAGCCCAT 
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FTICUME 14 

GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGT 
ATGAGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGT 
GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCT 
TTCATATTTTCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATAT 
CAGTGACACTGGTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGG 
CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAA 
GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGG 
ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTG 
TGCTTACCTTTGGTATGGGCTCATTATATATGTTTGTTCAGACCATCCTTTCCTACCAAATG 
CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGG 
AGTAAGTGCACTTAGCATGCTGACTTGCTCATCAGTTTTGCACAGTGGCAATTTTGGGACTG 
ATTTAGAACAGAAACTCCATTGGAACCCCGAGGACAAAGGTTATGTGCTTCACATGATCACT 
ACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTTGGTTTTTTCCTGACTTACATTCGTGA 
TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGACACTG 
CACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCAGAGATATTTGATGAAAGGAT 
AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTCACAGAAGTTGCTTA 
TT CTT CT CTGAAATTTTCAA C CACTT AAT C AAGG CT GAC AGT AAC AC TG ATG AATG CTGAT A 
AT CAGGAAACATGAAAGAAGC C AT TTGATAGATT AT TCTAAAGG ATAT CAT CAAGAAGACTA 
TTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACTATG 
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FIGURE IS 

^^WWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPyISDTGTVAPEKCLFGAMLNI 
AAVLCIATIYVRYKQVHALSPEENVIIK^ 

AVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFG 
TDLEQKLHWNPEDKGYVLHM I TTAAEWSMS FSFFGFFLTYI RDFQK I S LRVEANLHGLTL YD 
TAPCP I NNERTRLLSRD I 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 
TCCGTCTCTCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTCAAACGGCCTA 
GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATTCCTCTGGTTTGTTGAAGCAGT 
TACCAAGAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGTTGAGTACA 
CGTT C C TGTTGATTTACAAAAGGTG CAGGTATGAG C AGGT CTGAAGACTAACATTTTGTGAA 
GTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCT 
TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACTGCAGTAACACT 
CCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGGTACAGTANC 
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FIGURE 17 

CCCACGCGTCCGCCCGCCGC7GCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 
CCGGGGTGCGGAGCCGAC&SgCGCCCGCTTCTCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 
CTTCGCCTTGTACTTGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAGG 
AGGCTGGAGGCAGGTCGCTGTGGTTCCCCTCCGACCTGGCAGAGCTGCGGGAGCTCTCTGAG 
GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTACGTGTTCCTGCTCTTCTGCGGCGCCTA 
CCTCTACAAACAGGGCTTTGCCATCCCCGGCTCCAGCTTCCTGAATGTTTTAGCTGGTGCCT 
TGTTTGGGCCATGGCTGGGGCTTCTGCTGTGCTGTGTGTTGACCTCGGTGGGTGCCACATGC 
TGCTACCTGCTCTCCAGTATTTTTGGCAAACAGTTGGTGGTGTCCTACTTTCCTGATAAAGT 
GGCC CTGCTGCAGAGAAAGGTGGAGGAGAACAGAAACAG CTTGTTTTTTTT CTTATTGTTTT 
TGAGACTTTTCCCCATGACACCAAACTGGTTCTTGAACCTCTCGGCCCCAATTCTGAACATT 
CCCATCGTGCAGTTCTTCTTCTCAGTTCTTATCGGTTTGATCCCATATAATTTCATCTGTGT 
GCAGACAGGGTCCATCCTGTCAACCCTAACCTCTCTGGATGCTCTTTTCTCCTGGGACACTG 
TCTTTAAGCTGTTGGCCATTGCCATGGTGGCATTAATTCCTGGAACCCTCATTAAAAAATTT 
AGTCAGAAACATCTGCAATTGAATGAAACAAGTACTGCTAATCATATACACAGTAGAAAAGA 
CACAI^TCTGGATTTTCTGTTTGCCACATCCCTGGACTCAGTTGCTTATTTGTGTAATGGA 
TGTGGTCCTCTAAAGCCCCTCATTGTTTTTGATTGCCTTCTATAGGTGATGTGGACACTGTG 
CATCAATGTGCAGTGTCTTTTCAGAAAGGACACTCTGCTCTTGAAGGTGTATTACATCAGGT 
TTTCAAACCAGCCCTGGTGTAGCAGACACTGCAACAGATGCCTCCTAGAAAATGCTGTTTGT 
GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCCGGTGATTC 
ACAAGGTCAGGAGTTCAAGACCAGCCTGGCCAAGATGGTGAAATCCTGTCTCTAATAAAAAT 
ACAAAAATTAGCCAGGCGTGGTGGCAGGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGC 
AGGAGAATTGCTTGAACCAAGGTGGCAGAGGTTGCAGTAAGCCAAGATCACACCACTGCACT 
CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FIGURE 18 

MRPLLGLLLVFAGCTFALYLLSTRLPRGRRLGSTEEAGGRSLWFPSDLAELRELSEVLREYR 
KEHQAWFLLFCGAYLYKQGFAIPGSSFUn^AGALFGPWLGLLLCCVLTSVGATCCYLLSS 
IFGKQLWSYFPDKVALLQRKVEENRNSLFFFLLFLRLFPMTPNWFLNLSAPILNIPIVQFF 
FSVLIGLIPYNFICVQTGSILSTLTSLDALFSWDTVFKLLAIAMVALIPGTLIKKFSQKHLQ 
LNETSTANHIHSRKDT 
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FICUIRE 19 

CCGAGGCGGGAGGAGCCCGAGGGGGCGCGAGCCCCGCATGAATCATTGTAGTCAATCATTTT 
CCAGTTCTCAGCCGCTCAGTTGTGATCAAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAA 
TAGGAAAATAACTTGGGATTTTATATTGGAAGAC^TOGATCTTGCTGCCAACGAGATCAGCA 
TTTATGACAAACTTTCAGAGACTGTTGATTTGGTGAGACAGACCGGCCATCAGTGTGGCATG 
TCAGAGAAGGCAATTGAAAAATTTATCAGACAGCTGCTGGAAAAGAATGAACCTCAGAGACC 
CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTATAAGGTTCTCGCAACCTTGGGATTAATCT 
TGCTCACTGCCTACTTTGTGATTCAACCTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTCT 
GGAG CT CACACCTGGCGCTCACTCAT C CAT CACATTAGGCTGATGT CCTTGCC CATTGCCAA 
GAAGTACATGT CAGAAAATAAGGG AGTT CCTCTGCATGGGGGTGATGAAGACAGAC CCTTT C 
CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGCCCATTCCTGCC 
AACTGCACTGGCTGTGCCCAGAAACACCTGAAGGTGATGCTCCTGGAAGACGCCCCAAGGAA 
ATTTGAGAGG CT CCATCCACTGGTGAT CAAGACGGGAAAGCCCCTGTTGGAGGAAGAG ATTC 
AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGAAGGCTTCTCTGAAGGGTTTTTCGCCAAG 
TGGTGGCGCTGCTTTCCTGAGCGGTGGTTC CCATTT CCTTATCCATGGAGGAGACCTCTGAA 
CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTTT CACT CACCTGCCATTTCCAAAAGATG 
CCTCTTTAAACAAGTGCT C CTTTC TT CACC CAGAAC CTGT TGTGGGGAGTAAGATGCATAAG 
ATGCCTGACCTATTTATCATTGGCAGCGGTGAGGCCATGTTGCAGCTCATCCCTCCCTTCCA 
GTGCCGAAGACATTGTCAGTCTGTGGCCATGCCAATAGAGCCAGGGGATATCGGCTATGTCG 
ACACCACCCACTGGAAGGTCTACGTTATAGCCAGAGGGGTCCAGCCTTTGGTCATCTGCGAT 
GGAACCGCTTTCTCAGAACTGTAGGAAATAGAACTGTGCACAGGAACAGCTTCCAGAGCCGA 
AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATGAAACTGCAAAAA 
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MDLAANEISIYDKLSETVDLVRQTGHQCGMSSKAIEKFIRQLLEKNEPQRPPPQYPLLIVVY 
KVI^TLGLILLTAYFVIQPFSPLAPEPVLSGAHTWRSLIHHIRLMSLPIAKKYMSENKGVPL 
HGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKHLKVMLLEDAPRKFERLHPLVIKT 
GKPLLEEEIQHFLCQYPEATEGFSEGFFAKWWRCFPERWFPFPYPWRRPLNRSQMLRELFPV 
FTHLPF PKDASLNKCS FLHPEP WGS KMHKMPDL F I IGSGE AMLQL I PPFQCRRHCQS VAMP 
I E PGD I GYVDTTHWKVYV I ARGVQP L V I CDGTAFSEL 
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FIGURE 21 

CCACGGTGTCCGTTCTTCGCCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 
AAGGGACACGTGGTTTCCGAACTGCCAGCT CAGAATAGGAAAATAACTTGGGAT TTTATATT 
GGAAGACATGGATCTTGCTGCCAACGAGATCAGCATTTATGACAAACTTTCAGAGACTGTTG 
ATTTGGTGAGACAGACCGGCCATCAGTGTGGCATGTCAGAGAAGGCAATTGAAAAATTTATC 
AGACAG CTGCTGGAAAAGAATGAACCTCAGAGAC CC C C C C CG CAGTATCCTCTC CTTATAGT 
TGTGTATAAGGTTCTCGCAACCTTGGGATTAATCTTGCTCACTGCCTACTTTGTGATTCAAC 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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FIGURE 22 



CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 
CAGGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 
CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCC 
CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCA 
CACCCGAATGGCGCCACTTCATCGACAAACAGGTACAGCCAACC&ISTCCCAGTTCGAAATG 
GACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGGAATGCCTGCTATGACATGCT 
TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGC 
TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 
AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCT 
CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 
GCGCCGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCAC 
CTGGAAGCCAGCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 
CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 
AGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTG 
GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 
GGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGC 
GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 
GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 
CAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 
CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 
TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 
GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACT 
TGATGCAACTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 
CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 
GGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 
ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 
AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTCCA 
GCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGC 
AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 
GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 
AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCC 
AGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGC 
TACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 
GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 
TCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 
GCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCA 
CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 
CCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 
ACCATGGGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGT 
GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 
ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 
CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 
GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCAC 
CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 
GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 
CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 
TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 
CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 
GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 
GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 
GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 
TGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGC 
GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 
CCTACTGAGGCGCGCIS&ACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 
GCCCGGGAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGTGA 
GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 
GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 
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FIGURE 23 

MSQFEMDTYAKSHDLMSGFWNACYDMLMSSGQRRQWERAQSRRAFQELVLEPAQRRARLEGL 
RYTAVLKQQATQHSMALLHWGALWRQI^ 

HHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDELAELETP 
MEAAEUDEQREKLVLSAECQLVTVVAWPGLLEVTTQNVYFYDGSTERVETEEGIGYDFRRP 
LAQLRE VHLRRFNLRRSALELFFI DQANYFLNFPCKVGTTPVSS PSQTPRPQPGP I P PHTQV 
RNQVYSWLLRIjRPPSQGYLSSRSPQEMLRASGLTQKWVQ 

SQYPVFPWVLQDYVSPTLDLSNPAVFRDLSKPIGWNPKHAQLVREKYESFEDPAGTIDKFH 
YGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAAWQARliESPADVKELIP 
EFFYFPDFLENQNGFDLGCLQLTNEKVGDVVLPPWASSPEDFIQQHRQALESEYVSAHLHEW 
IDLIFGYKQRGPAAEEALNVFYYCTYEGAVDLDHVTDERERKALEGIISNFGQTPCQLLKEP 
HPTRLS AEEAAHRLARLDTNSPS I FQHLDELKAF FAE VTVSASGLLGTHS WLP YDRNI SNYF 
SFSKDPTMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLL 
SQLSCM*DVVTCLALDTCGIYLISGSRDTT 

CVAISTELDMAVSGSEDGWIIHTVRRGQFVAALRPLGATFPGPIFHLALGSEGQIVVQSSA 
V^RPGAQVTYSLHLYSVNGKLRASLPl^QPTALTVTEDFVLLGTAQCAIiHILQLNTLLPAA 
PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQVSS 
GETEYNPTEAR 
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FIGURE 24 

SSSACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCC 
CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 
CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTC 
TGGACCCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTT 
CTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCC 
GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 
ATAGCCCGGGTCATCTTGGAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGC 
CCGCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 
TAAACCGCAATGCATACATCATGATCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAA 
AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGA 
CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTT 
TTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 
TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 
TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 
GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 
GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGSS&CAGCTCCGGCCCTGATCCAGGACTGC 
ACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGTCTCCATTTTGTGGT 
AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG 
AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 
GTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGCTAC 
TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGA 
TCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAA 
AAAGATTTTATTAAAGATATTTTGTTAACTC 
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FIGURE 25 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLF 
WTLNWVLALGQC VLAGAFAS F Y WAFHKP QD I PTFPL I SAF I RTLR YHTGS LAFGAL I LTLVQ 
IARVILEYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAK 
NAFMLLMRNIVRVVVLDKVTDLLLFFGKLLW 

WLP I MTS I LGAYVI ASGFFS VFGMCVDTLFLCFLEDLERNNGSLDRP YYMSKSLLKI LGKKN 
EAPPDNKKRKK 
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FTTCURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCG".'CCGGCCGCCGT 
GGCTAMTTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 
TTCTCTTCGTGGCCTCGGACGTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTC 
CAGTGTGACCACGTGCAATATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGC 
ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 
ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCCATAGG 
CCAGTCAATGTCGTCAATGTATACAACGATACCCAGATCAAATTACTCATTAAACAAGATGA 
TGACCTTGAAGTTCCCGCCTATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 

CAGGAAAT GACAGTGATGGGTCAGAG C CTT CTGAG AAG C G C ACACGGTTAGAAG AGG AGAT A 

GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 

CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCCATGGTGATGTTTGAGCTGG 

CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 

CAGTGGGTGCAAGACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 

CCACGTTTCCCGCCACAACCACCGGAACGAGGATGAGGAGAACACACTCTCCGTGGACTGCA 

CACGGATCTCCTTTGAGTATGACCTCCGCCTGGTGCTCTACCAGCACTGGTCCCTCCATGAC 

AGCCTGTGCAACACCAGCTATACCGCAGCCAGGTTCAAGCTGTGGTCTGTGCATGGACAGAA 

GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 

AGGCCATGGACATCTCCTTGAAGGAGAATTTGCGGGAAATGATTGAAGAGTCTGCAAATAAA 

TTTGGGATGAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAGCACAAGTT 

TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 

CAGGGACAGATCACTTCATCCAGGCTCTGGACAGCCTCTCCAGGAGTAACCTGGACAAGCTG 

TACCATGGCCTGGAACTCGCCAAGAAGCAGCTGCGAGCCACCCAGCAGACCATTGCCAGCTGC 

CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCTCTCTCATGGAGGGCAC 

TCCAGATGTCATGCTGTTCTCTAGGCCGGCATCCCTAAGCCTGCTCAGCAAACACCTGCTCA 

AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 

CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAGACCGACAGCTC 

GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 

TGCTGCACAACCATTTTGACCTCTCAGTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 

CTGGACGCACTTATTTCCCTCCTGTCCIAgGAATTTGATTCTTCCAGAATGACCTTCTTATT 

TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 

TTTT ATT AAAT AAAATG CTTATTTT AGG AAA 
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FIGURE 27 

MFVSDFRKEFYEWQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAF 

LEHKEQFHYF I L INCGANVDLLD I LQ PDEDT I FFVCDSHR PVNWNVYNDTQ I KLL I KQDDD 
LEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 
DYEQYEYHGTSSAMVMFEI^WMLSKDLNDMLVWAIVGLTDQWVQDKITQMKYVTDVGVLQRH 
VSRHNHRNEDEENTLSVDCTRISFEYDLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHGQKR 
LQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFL 
ASDWFATMSLMESPEKI)GSGTDHFIQAIJDSLSRSNLDKLYHGLELAKKQLRATQQTIASCL 
CTNLVISQGPFIiYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAP 
LSMEHGTVTWGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFL 

DALISLLS 
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G'1'ACCtCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 
AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 
GCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTCCAGTGTGACCANGTGCAATATANGCT 
GGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACTGCATTTCTTGAGCATAAAGAACAGTTTC 
ATTATTTTATTCTCATAAACTGTGGAGCTAATGTAGACCTATTGGATATTCTTCAACCTGAT 
GAAGACACTATATTCTTTGTGTGTGACACCCATAGGCCAGTCAATGTTGTCAATGTATACAA 
CGATACCC 



i 
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FIGURE 29 

CAGGAACCCTCTCTTTGGGTCTGGATTGGGACCCCTTTCCAGTACCATTTTTTCTAGTGAAC 
CACGAAGGGACGATAC CAGAAAACAC C CTCAACC CAAAGGAAATAGACTAC AGC CCCAATTG 
GCTGACTTTGGCTATAGAM.AAAGAAAGGAACGAAAAGAGACAGTTTTTTTTGGAAAGCTAA 
GTCTTCCCTTTATCGAGTCAAGAAACCCCCCCTTCTTGAGCTATTTACAGCTTTTAACAATT 
GAGTAAAGTACGCTCCGGTCACCATGGTGACAGCCGCCCTGGGTCCCGTCTGGGCAGCGCTC 
CTGCTCTTTCTCCTGATGTGTGAGATCCGTATGGTGGAGCTCACCTTTGACAGAGCTGTGGC 
CAGCGGCTGCCAACGGTGCTGTGACTCTGAGGACCCCCTGGATCCTGCCCATGTATCCTCAG 
CCTCTTCCTCCGGCCGCCCCCACGCCCTGCCTGAGATCAGACCCTACATTAATATCACCATC 
CTGAAGGGTGACAAAGGGGACCCAGGCCCAATGGGCCTGCCAGGGTACATGGGCAGGGAGGG 
TCCCCAAGGGGAGCCTGGCCCTCAGGGCAGCAAGGGTGACAAGGGGGAGATGGGCAGCCCCG 
GCGCCCCGTGCCAGAAGCGCTTCTTCGCCTTCTCAGTGGGCCGCAAGACGGCCCTGCACAGC 
GGCGAGGACTTCCAGACGCTGCTCTTCGAAAGGGTCTTTGTGAACCTTGATGGGTGCTTTGA 
CATGGCGACCGGCCAGTTTGCTGCTCCCCTGCGTGGCATCTACTTCTTCAGCCTCAATGTGC 
ACAGCTGGAATT AC AAGGAGACGTACGTGCACATTATGCATAAC CAGAAAGAGGCT GT CAT C 
CTGTACGCGCAGCCCAG CGAGCG CAGCAT CATG CAGAG CCAGAGT GTGATGCTGGACCTGGC 
CTACGGGGACCGCGTCTGGGTGCGGCTCTTCAAGCGCCAGCGCGAGAACGCCATCTACAGCA 

ACGACTTCGACACCTACATC AC CTTCAG CGGCCACCTCATCAAGG C CG AGGACGACTg&GGG 
CCTCTGGGCCACCCTCCCGGCTGGAGAGCTCAGGTGCTGGTCCCGTCCCCTGCAGGGCTCAG 
TTTGCACTGCTGTGAAGCAGGAAGGCCAGGGAGGTCCCCGGGGACCTGGCATTCTGGGGAGA 
CCCTGCTTCTATCTTGGCTGCCATCATCCCTCCCAGCCTATTTCTGCTCCTCTCTTCTCTCT 
TGGACCTATTTTAAGAAGCTTGCTAAC CTAAATATTCTAGAACTTT C C C AG CCT C GT AG C CC 
AGCACTTCTCAAACTTGGAAATGCATGCGAATCACCCGGGGTTCGTGTTAAATGCAGATTCT 
GACTCAGCAGGTCTGAGTGGGTCCAGGATTCTGTGTTTCTCATATGTTCCTGGGTGATGCTG 
ATGGGGT CAGT CTATGAAC CACACTGG AGCAAC CAGGTT CT AGGACTTT CTCAATATTCTAG 
TACTTTCTGAACATTCTGGAATCCTCCCCACATTCTAGAATTCTCCCAACATTTTTTTTTCT 
TGAGACAGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCAATCTCAGTTCACTGC 
AACCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTCCCTAGTGGCTGGGATTAC 
AGGCGCCTGCTACCATGCCTGGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATA 
TTGGCCAGGCTGGTCTTGAACTCCTGACTTCAGGTGACCCACCCGCCTCGGCCTCTCAAAAT 
GCTGGGATTACAGGTGTGAGCCACCGTGCCTGGCCAATTCCAACATTCTTAAATTCTCTCAT 
CCCTCCAGGGCTCCCCGTGCTATGTTCTCTTTACCCCTTCCCCCTCTTCTCTTGCTCAGGCC 
TGCACCACTGCAGCCACCGTTCATTTATTCATTCATTAAACACTGAGCACTCACTCTGTGCT 
GGGTCCCGGGAAGGGTGAGGGGGTCAGACACAGGCCCTGCCCCTGCCCTCAGTGACTGGCCA 
GTCCAGCCCAGGCGGGGAGAGATGTGTACATAGGTTTTAAAGCAGACCCAGAGCTCATGGGG 
GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGGTCCTCCATTACCCACTGCTCCCCAAGGCTGG 
TGGGACGGGGTCCCGGTGGCAGGGGCAGGTATCTCCTTCCCGTTCCTCATCCACCTGCCCAG 
TGCTCATCGTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTCAAGGGTTCTGTGAGGAGAGG 
ACCCAGGAGTGTGGGGGCATTTGGGGGGTGAAGTGGCCCCCGAAGAATGGAACCCACACCCA 
TAGCTCTCCCCACAGCTGATACGGCATCCTGCGAGAAGACCTGCCCTCCTCACTGGGATCCC 
CTTCCTGCCTCCTCCCAGGGCTCTGCCAGGGCCTTGCTCAGTCCCTTCCACCAAAGTCATCT 
GAACTTCCGTTTCCCCAGGGCCTCCAGCTGCCCTCAGACACTGATGTCTGTCCCCAGGTGCT 
CTCTGCCCCTCATGCCCCTCTCACCGGCCCAGTGCCCCGACTCTCCAGGCTTTATCAAGGTG 
CTAAGGCCCGGGTGGGCAGCTCCTCGTCTCAGAGCCCTCCTCCGGCCTGGTGCTGCCTTTAC 
AAACACCTGCAGGAGAAGGGCCACGGAAGCCCCAGGCTTTAGAGCCCTCAGCAGGTCTGGGG 
AGCTAGAGCAAAGGAGGGACCTCAGGCCTTCCGTTTCTTCTTCCAGGGTGGGGTGGCCTGGT 
GTTCCCCTAGCCTTCCAAACCCAGGTGGCCTGCCCTTCTCCCCAGAGGGAGGCGGCCTCCGC 
CCATTGGTGCTCATGCAGACTCTGGGGCTGAGGTGC CC CGGGGGGTGATCTCTGGTGCTCAC 
AGC CGAGGGAG CCGTGG CT CC ATGGCCAGATGACGGAAACAGGGTCTGACCAAGTGCCAGGA 
AGACCTGTGCTATAAACCACCCTGCCTGATCCTGCCCCTGCCTGACCCCGCCACGCCCTGCC 
GTCCAGCATGATTAAAGAATGCTGTCTCCTCTTGGAAAAAAAAAAAAAAAA 
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MVTAALGPVWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDPAHVSSASSSGRPH 
ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 
FAFSVGRKTAIiHSGEDFQTLLFERVFVNLDGCFDMATGQF AAPLRG I YFFSLNVHSWNYKET 
YVH IMHNQKEAV I LYAQ P SERS IMQSQSVMLDLA YGDRVWVRLFKRQRENAI YSNDFDTY I T 
FSGHLIKAEDD 

Important features: 
Signal peptide t 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-75 

Clg domain proteins. 

amino acids 144-178, 78-111 and 84-117 
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ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTG 
AGGCCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAG&SSTG 
CTCCAGGGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGG 
GCTGCCCATCCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGG 
ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 
CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCAC 
AGAACCAGATCGCCAGCCTGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTG 
GACCTGACGGCCAACAGGCTGCATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCT 
CGAGCGCCTCTACCTGGGCAAGAACCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGC 
TCGACCGCCTCCTGGAGCTCAAGCTGCAGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGC 
CTGCCCCGCCTGCTGCTGCTGGACCTCAGCCACAACAGCCTCCTGGCCCTGGAGCCCGGCAT 
CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTGGTCTGGGGCTGCAGCAGCTGGACG 
AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACCTGGATGTGTCCGACAACCAGCTGGAG 
CGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACGCGCCTGCGGCTGGCCGGCAACAC 
CCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGCTGCCCTGCAGGAGCTGGATG 
TGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCCTCTTCCCCCGCCTGCGG 
CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGCTGGTTTGGCCCCTG 
GGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCACTTCCCGCCCA 
AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAGCCACCACC 
ACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCTTCTAG 
CTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTCCA 
CTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTC 
AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTT 
CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCA 
CGCCGAGGCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGC 
GTGGGGCTGCAGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTA 
TCGCAACCTATCGGGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTG 
AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGG 
CCCGGGCGGGTGCCGGAGGGCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCA 
CTCCAACCACGCCCCAGTCACCCAGGCCCGGGAGGGCAACCTGCCGCTCCTCATTGCGCCCG 
CCCTGGCCGCGGTGCTCCTGGCCGCGCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGG 
GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGCCAGGGGCTGGGGCCCT 
GGAACTGGAGGGAGTGAAGGTCCCCTTGGAGCCAGG CCCGAAGG CAACAG AGGG CGGTGGAG 
AGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCCACTCATGGGCTTCCCAGGGCCTGGCCTC 
CAGTCACCCCTCCACGCAAAGCCCTACATCS&AGCCAGAGAGAGACAGGGCAGCTGGGGCCG 
GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCCACACCACGTAAGTTCTCAGTCC 
CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCCTCTGGA 
CCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCCCTAACGTCCCCAGAAC 
CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTCCCTGGGCACGGCG 
GGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCACTCCAGGCGGA 
CCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGCGGCTGTG 
TGACTCTAGTCTTGGC CC CAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGCTTTA 
GGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCTG 
GGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATG 
AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFEN 

GITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLR 

RLERLYLGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEP 

GILDTANVEALRIiAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAG 

NTRIAQLRPEDLAGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCV 

PVfVRESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPVVREPTALS 

SSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHIiGTRHHLACLCPE 

GFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRL 

TYRNLSGPDKRLVTLRLPASLAEYTVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPA 

VHSNHAPVTQAREGNLPLLIAPAIAAVLLAAl^VGAAYCVRRGRAMAAAAQDKGQVGPGAG 

PLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPGPGLQSPLHAKPYI 
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FIGURE 33 

GAATCATCCACGCACCTGCAGCTCTGCTGAGAGAGTGCAAGCCGTGGGGGTTTTGAGCTCAT 
CTTCATCATTCATATGAGGAAATAAGTGGTAAAATCCTTGGAAATACA&TGAGACTCATCAG 
AAAC AT TT AC ATATTTTGTAGTATTGTTATGACAGCAGAGGGTGATG CT CCAGAG CTG C CAG 
AAGAAAGGGAACTGATGACCAACTGCTCCAACATGTCTCTAAGAAAGGTTCCCGCAGACTTG 
ACCCCAGCCACAACGACACTGGATTTATCCTATAACCTCCTTTTTCAACTCCAGAGTTCAGA 
TTTTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGCCATAACAGAATTCAACAGCTGG 
AT CT CAAAACCTTTGAATTC AACAAGGAGTTAAGATATTTAGATTTGTCTAATAA CAGACTG 
AAGAGTGTAACTTGGTATTTACTGGCAGGTCTCAGGTATTTAGATCTTTCTTTTAATGACTT 
TGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACACCTGGAAATCCTAGGTTTGA 
GTGGGGCAAAAATACAAAAATCAGATTTCCAGAAAATT GCTC AT CTGCAT CTAAATACTGTC 
TTCTTAGGATTCAGAACTCTTCCTCATTATGAAGAAGGTAGCCTGCCCATCTTAAACACAAC 
AAAACTGCACATTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTGATGGAATCA 
AGACTTCAAAAATATTAGAAATGACAAATATAGATGGCAAAAGCCAATTTGTAAGTTATGAA 
ATGCAACGAAATCTTAGTTTAGAAAATGCTAAGACATCGGTTCTATTGCTTAATAAAGTTGA 
TTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTTGGCATACATCAGTGGAACACT 
TT CAGAT C CGAAATGTGACTTTTGGTGGTAAGGCTTAT CTTGACC ACAATT CATTTGACTAC 
TCAAATACTGTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTTACATTCA 
ACAGGATAAAATCTATTTGCTTTTGACCAAAATGGACATAGAAAAC CTGACAAT AT CAAATG 
CACAAATGCCACACATGCTTTTCCCGAATTATCCTACGAAATTCCAATATTTAAATTTTGCC 
AATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCACTTGAAAACTCT 
CATTTTGAATGGCAATAAACTGGAGACACTTTCTTTAGTAAGTTGCTTTGCTAACAACACAC 
CCTTGGAACACTTGGATCTGAGT CAAAATCTATTACAACATAAAAATGATGAAAATTGCT CA 
TGGCCAGAAACTGTGGTCAATATGAATCTGTCATACAATAAATTGTCTGATTCTGTCTTCAG 
GTGCTTGCC C AAAAGTATTCAAATACTTGAC CT AAAT AATAAC CAAAT C CAAACTGT ACCT A 
AAGAGACTATTCATCTGATGGCCTTACGAGAACT AAAT ATTG CATTTAATTTTC TAACTGAT 
CTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCTGAACATTGAAATGAACITCATTCT 
CAGCCCATCTCTGGATTTTGTTCAGAGCTGCCAGGAAGTTAAAACTCTAAATGCGGGAAGAA 
ATCCATTCCGGTGTACCTGTGAATTAAAAAATTTCATTCAGCTTGAAACATATTCAGAGGTC 
ATGATGGTTGGATGGTCAGATTCATACACCTGTGAATACCCTTTAAACCTAAGGGGAACTAG 
GTTAAAAGACGTTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGATTGTCACCATTG 
TGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCCTTCTGCTGTCTCCACTTTGATCTGCCC 
TGGTATCTCAGGATGCTAGGTCAATGCACAC AAACATGG C ACAGGGT TAGGAAAACAACCCA 
AGAACAACTCAAGAGAAATGTCCGATTCCACGCATTTATTTCATACAGTGAACATGATTCTC 
TGTGGGTGAAGAATGAATTGATC CCCAATCT AG AG AAGGAAGATGGTT CTAT CT TGATTTG C 
CTTTATGAAAG CTACTTTG ACC CTGGCAAAAG CATTAGTGAAAAT ATTGT AAGCTT CAT TG A 
GAAAAG CTATAAGTCCATCTTTGTTTTGT CT C CC AACTTTGTC CAGAATGAGTGG TG C CAT T 
ATGAATTCTACTTTGC CCACCACAAT CT CTTC CATGAAAATTCTGATCATATAATTCTTAT C 
TTACTGGAACCCATTCCATTCTATTGCATTCCCACCAGGTATCATAAACTGAAAGCTCTCCT 
GGAAAAAAAAGCATACTTGGAATGGCCCAAGGATAGGCGTAAATGTGGGCTTTTCTGGGC?LA 
ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAATGTATGAACTGCAGACA 
TTCACAGAGTTAAATGAAGAGT CT CGAGGTTCTACAATCT CTCTGATGAGAACAGATTGTCT 
ATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTGTTGGGATGTACATTGATA 
CAACCTTTATGATGGCT^TTTGACAATATTTATTAAAATAAAAAATGGTTATTCCCTTCATA 
TC AGTTTCTAGAAGGATTTCTAAGAATGTATC CT ATAGAAACAC CTTCACAAGTTTATAAGG 
GCTTATGGAAAAAGGTGTTCATC CCAGGATTGT TT AT AATC ATGAAAAATGTGGCCAGGTG C 
AGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGACCCACGAGGTCAA 
GAGATGGAGACCATCCTGGCCAACATGGTGAAACC C TGT CTCTACTAAAAATACAAAAATTA 
GCTGGGCGTGATGGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCG 
CTTGAACCCGGGAGGTGGCAGTTGCAGTGAGCTGAGATCGAGCCACTGCACTCCAGCCTGGT 
GACAGAGCGAGACTCCATCTCAAAAAAAAGAAAAAAAAAAAAGAAAAAAATGGAAAACATCC 
TCATGGCCACAAAATAAGGTCTAATTCAATAAATTATAGTACATTAATGTAATATAATATTA 
CATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATTAATAT 
GTTATAAACTATTAGGTTGGTGCAAAACTAATTGTGGTTTTTGCCATTGAAATGGCATTGAA 
ATAAAAGTGTAAAGAAAT CTAT AC CAGATGTAGT AACAGTGGTTTGGGTCTGGGAGGTTGGA 
TTACAGGGAGCATTTGATTTCTATGTTGTGTATTTCTATAATGTTTGAATTGTTTAGAATGA 
ATCTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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FIGURE 34 

MRLIRNIYIFCSTVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 

liQSSDFHSVSKLRVLILCHNRIQQLDLKTFEFNKELRYLDLSiraRLKSVTWYLLAGLRYLDL 

SFNDFDTMP I CEEAGNMSHLEI LGLSGAKI QKSDFQKI AHLHLNTVFLGFRTLPHYEEGSLP 

I LNTTKLHI VLPMDTNFWVLLRDG I KTS K I LEMTNI DGKSQFVS YEMQRNLSLENAKTS VLL 

LNKVDLLWDDLFLI LQFVWHTS VEHFQ I RNVTFGGKAYLDHNS FDYSNTVMRT I KLEHVHFR 

VFYIQQDK I YLLLTKMD I ENLT I SNAQMPHMLFPNYPTKFQYLNFANNI LTDELFKRT IQLP 

HLKTLILNGNKLETLSLVSCFANNTPLEHLDLSQNLLQHKNDENCSWPETVVNMNLSYNKLS 

DSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 

1VINFILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETYSEVMMVGWSDSYTCEYPLN 

LRGTRLKDVHLHELSCNTALLIVTIWIMLVLGLAVAFCCLHFDLPWYLRM 

RKTTQEQLKRNVRFHAF I SYSEHDSLWVKNEL I PNLEKEDGS IL I CLYES YFDPGKS I SEN I 

VSFIEKS YKS I FVLSPNFVQNEWCHYEFYFAHHNLFHENSDHI I LI LLEP I P FY CI PTRYHK 

LKALLEKKAYLEWPKDRRKCGLFWANLRAAI^^WVLATREMYELQTFTE 

RTDCL 
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GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGT 
CGCCGGGAAAGGGAGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCT 
CTGCGCGCCCCTGTCCGCCCCGGCCCAGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCA 
GCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTGCTGTGCCCTGCGCCCTTGCCCCG 
CGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCTGCCCTGGGCG 
CGGGGCGGAGCAGGCATGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCT 
GGCAGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACG 
GGCAGGAGAT CTGGAG CCGGGAGC CCTACTACGC G CGC C CGG AGCC CGAGCTCGAGACCTT C 
TCTCCGCCGCTGCCTGCGGGGCCCGGGGAGGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCC 
GCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGAGAAGTCGGCTCCGGAGCCGC 
CTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAAGGCTGCC 
AACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGG 
TCTGGAAACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCC 
TGGGGGCACATCGAGGGAGACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGAC 
GGAGCGTGGTGCGCGGGAAGAAATGACCTCCAGCAGTGGATTGAAGTGGATGCTCGGCGCCT 
GACC AGATTCACTGGTGTCATCACTCAAGGGAGGAACT C C CT CTGG CTGAGTGACTGGGTGA 
CATC CTATAAGGTCATGGTGAGCAATGACAGCCACACGTGGGTC AC TGTT AAGAATGGATCT 
GGAGACATGATATTTGAGGGAAACAGTGAGAAGGAG AT CC CTGT TC TCAATGAG CTACCCGT 
CCCCATGGTGGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCT 
GCATGAGAATGGAGATCCTGGGCTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGG 
AACGAGATGACCACCACTGATGAC CTGGAT TTT AAG CAC C AC AATTATAAGGAAATGCGCCA 
GTTGATGAAAGTTGTGAATGAAATGTGTCCCAATATCACCAGAATTTACAACATTGGAAAAA 
GC CACCAGGG CCTGAAGCTGTATGCTGTGGAGAT CTCAGATCACC CTGGGGAGCATGAAGT C 
GGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCT 
GCTGCTGCTGCTGGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCC 
AC CTGGTGGAGGAGACGCGGATTCACGTC CT C C C CT C C CT CAACCC CGATGG CTACGAGAAG 
GC CTACGAAGGGGG CT CGGAGCTGGGAGGCTGGT CC CTGGGACGCTGGACCCACGATGGAAT 
TGACATCAACAACAACTTTCCTGATTTAAACACGCTGCTCTGGGAGGCAGAGGATCGACAGA 
ATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTCGGAAAAT 
GC CACGGTGGCTGCCGAGACCAGAGCAGTCATAGC CTGGATGGAAAAAAT C CCTTT TGTG CT 
GGGCGGCAACCTGCAGGGCGGCGAGCTGGTGGTGGCGTATCCCTACGACCTGGTGCGGTCCC 
CCTGGAAGACGCAGGAACACACCCCCACCCCCGATGACCACGTGTTCCGCTGGCTGGCCTAC 
TC CTATGCCTCCACACACCGCCTCATGACAGACGCCCGGAGGAGGGTGTG C CAC ACGGAGGA 
CTTCCAGAAGGAGGAGGGC^CTGTCAATGGGGCCTCCTGGCACACCGTCGCTGGAAGTCTGA 
ACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAA 
TACCCACATGAGAGCCAGCTGCCCGAGGAGTGGGAGAATAACCGGGAATCTCTGATCGTGTT 
CATGGAGCAGGTTCATCGTGGCATTAAAGGCTTGGTGAGAGATT CACATGGAAAAGGAAT CC 
CAAACGC CATTATCTCCGTAGAAG GCATTAAC CATGACATCCGAAC AG CCAACGATGGGGAT 
TACTGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAAGGCCGAAGGTTTCACTGC 
ATCCACCAAGAACTGTATGGTTGGCTATGAC^TGGGGGCCACAAGGTGTGACTTCACACTTA 
GCAAAACCAACATGGC CAGGAT CCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAG C 
CTGCCAGCCAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGSGACCCTCCTG 
GGCCCTTGAGACTCGTCTGGGACCCATGCAAATTAAACCAACCTGGTAGTAGCTCCATAGTG 
GACTCACTCACTGTTGTTTCCTCTGTAATTCAAGAAGTGCCTGGAAGAGAGGGTGCATTGTG 
AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTTCTTTGTTCCCATTTA 
TCCAAATAACTTGG AC AGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAG CAAG CC 
AACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGC CTGT CCGTTCAGAGC CTCTGGCTGC 
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ATAGAAAAGGATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGCATTTGC 
AATTTGCACAGCTAAAATTGCAGCATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGA 
GATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTCTGGCCCTGGGACATTGCAAGCTGCTA 
CAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACATCAGTGAGCCTCT 
TGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 
GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCCATA 
AACAACCCCCAAAGTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCA 
GAGGTGCCAGCCTTCCTGAAGGGCCAGAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAG 
GACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTCTCTCTGCTTGAGGTATTGCCCCT 
GTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTTTTGATATGTA 
GAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAG 
CATCAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTT 
AATGTTGCTGCCTCATTGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACC 
CTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 36 

MSRPGTATPALALVLLAVTLAGVGAQGAALEDPDYYGQEIWSREPYYARPE9ELETFSPPLP 
AGPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHS 
VRVAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCA 
GRNDLQQWIEVDARRLTRFTGVITQGRNSLWLSDWTS 

EGNSEKE I PVLNELPVPMVARYIR INPQSWFDNGS I CMRME I LGCPLPDPNNYYHRRNEMTT 
TDDLDFKHHNYKEMRQLMKVVNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEF 
HYIAGAHGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEKAYEGG 
S E LGGWS LGRWTHDG I D I NNNF PD LNTLLWEAEDRQNVPRKVPNHY I A I P EW FL S ENATVAA 
ETRAVIAWMEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 
HRLMTDARRRVOTTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHES 
QLPEEWENNRESLI VFMEQVHRGI KGLVRDSHGKG I PNAI I S VEGINHDI RTANDGDYWRLL 
NPGEYWTAKAEGFTASTKNCMVGYDMGATRCDFTLSKTNMARIREIMEKFGKQPVSLPARR 
LKLRGRKRRQRG 
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OTA/vGAGGACAAGATgAGGCCCGGCCTCTCATTTCTCCTAGCCCTTCTGTTCTTCCTTGGCC 
AAGCTGCAGGGGATTTGGGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTC 
CCAGGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCG 
CAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTTCACCGGCTCCGTGGATG 
ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGAGTG 
GAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAGAACTTTCTAAAGT 
GAGGG AATATGTCCAATTAATTAGTGTGTATGAAAAGAAACTGTTAAACCTAACTGTCCGAA 
TTGACATCATGGAGAAGGATACCATTTCTTACACTGAACTGGACTTCGAGCTGATCAAGGTA 
GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTGGAAGCTCAGA 
AATTGTTGAC CAGCTGGAGGTGGAGATAAGAAATATGACT CTCTTGGTAGAGAAGCTTGAGA 
CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCTG 
AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 
GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGA 
GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAA 
GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 
CAACACACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTTGCGGATCACCTATGGCC 
AAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACATGTACAACACCGGGAAT 
ATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACTCTCCCTAATGCTGC 
CTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGATATTGACTTTGCTGTGGATG 
AGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGCACTGGTAACATGGTGATTAGTAAA 
CTCAATGACACCACACTTCAGGTGCTAAACACTTGGTATACCAAGCAGTATAAACCATCTGC 
TTCTAACGCCTTCATGGTATGTGGGGTTCTGTATGCCACCCGTACTATGAACACCAGAACAG 
AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATG 
CATAAGATGCAGGAAAAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGT 
CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAGTAAGCTG 
TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCTTCTCCACTTACTTAG 
ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGCATAGTTCTA 
CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 
CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 
GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAAGG 
AACTTAAAACTCAGTATGGCGTCTAGGGATTCTTTGTACAGGAAATATTGCCCAATGACTAG 
TCCTCATCCATGTAGCACCACTAATTCTTCCATGCCTGGAAGAAACCTGGGGACTTAGTTAG 
GTAGATTAATATCTGGAGCTCCTCGAGGGACCAAATCTCCAACTTTTTTTTCCCCTCACTAG 
CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCTACA 
TCTGTAAAGTG CTGAGTTTTATGGAGAGAGG C CT TTTT ATGC ATTAAATTGTACATGG CAAA 
TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 
ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAG 
AACCAGACTTACTAACC^TTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 
ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATTTTCTTTAATTTCATTA 
TGGACTTTTATTTACATGACTCTAAGACTATAAGAAAATCTGATGGCAGTGACAAAGTGCTA 
GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTATGAGTGTATCAGTTG 
TTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTTGTAAGAAAATGAAAATTTAATTTT 
TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCTAGTTAATCAGTGCAGTAGTTG 
GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATCATCTAGTCTT 
TGTCTATTTTTCCTTTGATGTTCAAGTCCTAGTCTATAGGATTGGCAGTTTAAATGCTTTAC 
T CCCCCTTTTAAAATAAATGATTAAAATGTGCTTTC 
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FIGURE 38 

MRPGLSFLLALLFFLGQAAGDLGDVGPPIPSPGFSSFPGVDSSSSFSSSSRSGSSSSRSLGS 
GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 
QLISVYEKKLLNLTVRIDIMEKDTISYTELDFELIKVEVKEMEKLVIQLKESFGGSSEIVDQ 
LEVEIRNMTLLVEKLETLDKNNVLAIRREIVALKTKLKECEASKDQNTPVVHPPPTPGSCGH 
GGVVNISKPSWQIiNWRGFSYLYGAWGRDYSPQHPNKGLYWVAPLNTDGRLLEYYRLYNTLD 
DLLL Y I NARELR I T YGQGSGTAVYNNNM YVNM YNTGN I ARVNLTTNT I AVTQTL PNAAYNNR 
F S YANVAWQD I D F AVDENGL WV I Y STEASTGNMV I S KLNDTTLQ VLNTWYTKQYKPSASNAF 
MVCGVLYATRTMNTRTEE IFYYYDTNTGKEGKLDIVMHKMQEKVQS INYNPFDQKLYVYNDG 
YLLNYDLSVLQKPQ 
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FIGURE 39 

GCTCTGAAGACCAAGCTGAAAGAGTGTGAGGCCTCTAAAGATCAAACACCCC'rCi'rLUl'CCAC 
CCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGT 
GGTTCAGCTCAACTGGAGAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTC 
CCCAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCATTGAATACAGATGGGAGACTGTTG 
GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTT 
GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACA 
TGTACAACACCGGGNATATTGCCAGAGTTAACCTGACC 
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FIGURE 4(D> 



TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGAAAGAAGCTGTCTCCATCTTGTCTGTAT 
CCGCTGCTCTTGTGACGTTGTGGAGASSGGGAGCGTCCTGGGGCTGTGCTCCATGGCGAGCT 
GGATACCATGTTTGTGTGGAAGTGCCCCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAAC 
AACTCCACTGTAACTAGATTGATCTATGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTG 
TGTAATGTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATG 
AGAAAGGTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTT 
GGTTTGGCTATGTT CT ATCTTCTT CT CTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTG A 
TCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTA 
TTATTGGGGCATTCTTCATTCCAGAAGGAACTTTTACAACTGTGTGGTTTTATGTAGGCATG 
GCAGGTGCCTTTTGTTTCATCCTCATACAACTAGTCTTACTTATTGATTTTGCACATTCATG 
GAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGAACTCGAGATGTTGGTATG CAG CCTTGT 
TAT CAG CTACAGCTCTGAATTAT CTGCTGTCTTTAGTTGCT AT CGTC CTGTT CTTTGTCTAC 
TACACTCATCCAGCCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTCAACATGCTCCTCTG 
CGTTGGTGCTTCTGTAATGTCTATACTGCCAAAAATCCAAGAATCACAACCAAGATCTGGTT 
TGTTACAGTCTTCAGTAATTACAGTCTACACAATGTATTTGACATGGTCAGCTATGACCAAT 
GAAC CAGAAACAAATTGCAACCCAAGTCTACTAAGCATAATTGGCTACAATAC AACAAG CAC 
TGTCCCAAAGGAAGGGCAGTCAGTCCAGTGGTGG CATG CT CAAGGAAT TATAGGACTAATT C 
TCTTTTTGTTGTGTGTATTTTATTCCAGCATCCGTACTTCAAACAATAGTCAGGTTAATAAA 
CTGACTCTAACAAGTGATGAATCTACATTAATAGAAGATGGTGGAGCTAGAAGTGATGGATC 
ACTGGAGGATGGGGACGATGTT CACCGAGCTGTAGATAATGAAAGGGATGGTGT CACTT ACA 
GTTATTCCTTCTTTCACTTCATGCTTTTCCTGGCTTCACTTTATATCATGATGACCCTTACC 
AACTGGTCCAGGTATGAACCCTCTCGTGAGATGAAAAGTCAGTGGACAGCTGTCTGGGTGAA 
AATCTCTTCCAGTTGGATTGGCATCGTGCTGTATGTTTGGACACTCGTGGCACCACTTGTTC 
TTACAAATCGTGATTTTGACIS&GTGAGACTTCTAGCATGAAAGTCCCACTTTGATTATTGC 
TTATTTGAAAACAGTATTCCCAACTTTTGTAAAGTTGTGTATGTTTTTGCTTCCCATGTAAC 
TTCTCCAGTGTTCTGGCATGAATTAGATTTTACTGCTTGTCATTTTGTTATTTTCTTACCAA 
GTGCATTGATATGTGAAGTAGAATGAATTGCAGAGGAAAGTTTTATGAATATGGTGATGAGT 
TAGTAAAAGTGGCCATTATTGGGCTTATTCTCTGCTCTATAGTTGTGAAATGAAGAGTAAAA 
ACAAATTTGTTTGACTATTTTAAAATTATATTAGACCTTAAGCTGTTTTAGCAAGCATTAAA 
GCAAATGTATGGCTGCCTTTTGAAATATTTGATGTGTTGCCTGGCAGGATACTGCAAAGAAC 
ATGGTTTATTTTAAAATTTATAAACAAGTCACTTAAATGCCAGTTGTCTGAAAAATCTTATA 
AGGTTTTACCCTTGATACX3GAATTTACACAGGTAGGGAGTGTTTAGT 

TGGATGGAGGTGTCGGTACTAAATTGAATAACGAGTAAATAATCTTACTTGGGTAGAGATGG 
CCTTTGCCAACAAAGTGAACTGTTTTGGTTGTTTTAAACTCATGAAGTATGGGTTCAGTGGA 
AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTTGAAAAGGATAATCATGGGTTAGAAGG 
AAGTGTTTTGAAAGTCACTTTGAAAGTTAGTTTTGGGCCCAGCACGGTAGCTCACCCTTGGT 
AATCCCAGCACTTTGGGAGCTTAAGTGGGTAGATTACTTGAGC C CAGGAATTCAGACCAGCT 
TGGCACATGGTGAACCTGTTCTATAAAAATAATCTGGCTTTGAGCATATGCCTGTGGTCCAG 
CACTGAGAGGCTAGTGAAGATTGCTGAGCCCAGAGCCAAAGGTTGCAGTGAGCAAGTCACGT 
CACTGCACTCTAGCTGGCACAGAGTAAGCCAAAAAAATATATATATATTGAAATCAAGGAGG 
CAAAATTTTGACAGGGAAGGAAGTAACTGCAAAACCACTAGGCTTTAGTAGGTACTTATATA 
AAAT CTAGTCCAGTTCTCTCATTTAAAAAAATGAAGACACTGAAATACAGACTTAAAT AG CT 
CAGATAGCTAATTAGGAAATTTCAAGTTGGCCAATAATAGCATTCT CT CTGACATT TAAAAA 
TAATTTCTATTCAAAATACATGCATATTGATTTACACCTCATACTGTGATAATTAATGTGAT 
GTGGATTGCTGGTGTCCAGCATGACCCATAAACAGGTCAGAAGAATGATGGAATGTTTTAGA 
ATAAACTC CTGCTT ATAGTATACTACAC AGTT CAAAAGATGTTTAAAATGCTTTTGT ATTTA 
CTGCCATGTAATTGAAATATATAGATTATTGTAACCTTTCAACCTGAAAATCAAGCAGTATG 
AGAGTTTAGTT ATTTGTATGTGTCACTAGTGTCTAATGAAG CTTTTAAAATCTACAATTT CT 
TCTTTAAAAATATTTATTAATGTGAATGGAATATAACAATTCAGCTTAATTCCCCAACCTTA 
TTCTGTGTGTAGACATTG TATT C CACAATTTTGAATGG CTGTGTTTTACCTCTAAAT AAATG 
AATT CAG AG AAAAAAAAAAAAAAA 
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FIGURE 41 

MGSVLGLCS^4ASWIPCLCGSAPCLLCRCCPSGNNSTVTRLIYALFLLVGVCVACVMLIPGME 
EQLNKIPGFCENEKGWPCNILVGYKAVYRLCFGLAMFYLLLSLLMIKVKSSSDPRAAVHNG 
FWFFKFAAAIAI I IGAFF I PEGTFTTWFYVGMAGAFCFILIQLVLLI DFAHSWNESWVEKM 
EEGNSRCV^AALLSATALNYLLSLVAIVLFFVYYTHPASCSENKAFISVT^LLCVGASVMSI 
LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLS I IGYNTTSTVPKE GQSV 
QWWHAQGI IGLILFLLCVFYSS IRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 
RAVDNERDGVTYSYSFFHFMLFIiASLYIMMTLTNWSRYEPSREMKSQWTAVWVKISSSWIGI 
VLYVWTLVAPLVLTNRDFD 
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FIGURE 42 



GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCTTCTTGNGACGTTGTGGAGAT 
GGGGAGCGTCCCTGGGGCTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCC 
CCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGATTGATCTA 
TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAG 
AACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATT 
TTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCT 
C T CTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATC CT AGAG CTG CAGTG CACAATGGAT 
TTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 43 

GTTATTGTGAACTTTGTGGAGATGGGAGGTCNTGGGGCTGTGTTCCATGGCGAGCTGGATAC 
CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAANTCC 
ACTGTAATTAGATTGATNTATGCACTTTTNTTGCTTGTTGGAGTANGTGTAGCTTGTGTAAT 
GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAG 
GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATNGTTTGTGCTTTGGTTTG 
GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAG 
AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 
GGGC 
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FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCTGCTCTTGTGAACGTTNTGGAGATGGGGAGC 
GTCCTTGGGGTTG^GCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCCCCGTGTT 
TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACTT 
TTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAGAACAACT 
GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATTTTGGTTG 
GCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTA 
CTAATGATCAAAGTGAAGAGTAGCAGTGAT C C TAGAG CTGCAGTGCACAATGGATTTTGGTT 
CTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 45 

GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTGATCTATGCACTTTTTCCTTG 
CTTGTTGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAACAACTGAATA 
AGATTCCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCTTGTAACATTTTTGGTTGGC 
TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTACT 
AATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTTCT 
TTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGCATTCTTCATTCCAGAAGGAACTTTT 
ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCTCATACAACTAGT 
CTTACTTATTGATTTTGCACATTCATGGAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGA 
ACTCGAGATGTTGGTATGCAGCCTTGTTATCAGCTACAGCTCTGAATTATCTGCTGTCTTTA 
GTTGCTATCGTCCTGTTCTTTGTCTACTACACTCATCCAGCCAGTTGTTCAGAAAACAAGGC 
GTTCATCAGTGTCAACATGCTCCTCTGCGTTGGTGCTTCTGTAATG 
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FIGURE 46A 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCC 

GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAC^CGAGCGCGAGGCTTTCGGCAAAGGCA^ 

CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATTAACGT 

GTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTTGGCTCCCT 

CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGAT TC ACGT CGTTTC CAGC CA 

AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 

CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGT 

TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGA 

GAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACAGACGCAAC 

TTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAAAAAGAAGATGGGCCCCCCGA 

GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTCGGCCTTC 

CTGT CG CACCACCG CCTGAAAGGCAGGTTT CAGAGGGACCGCAGGAACATCCGCCCCAACAT 

CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGA 

CC CGGCG C AT CATGGAGCAGGGCGGGGCGCACTTCATCAACGCCTTCGTGAC CACACC CATG 

TGCTGCCCCTCACGCTCCTCCATCCTCACTGGCAAGTACGTCCACAACCACAACACCTACAC 

CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGAGAGCCGCACCTTTGCCG 

TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATACAAC 

GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTA 

TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATT 

AC CT CACAGACCTCATCACCAATGACAG CGTGAG CTTCTT C CGC ACGTCCAAGAAGATGT AC 

CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGC 

CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACG 

CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATG 

GAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCAT 

GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACA 

CCGCCGACCACGGTTACCACATCGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATAT 

GAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAA 

TCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACA 

TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAAT 

CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 

GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 

ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGA 

CAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAAGGGCCC 

CATGCGGCTGGGCGGCAGCAGAGCCCrCTCCAACCrrCGTGCCCAAGTACTACGGGCAGGGCA 

GCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 

CTCTTCAAGAAGAAGTACAAGGCCAGCTATGT CCGCAGTCGCTCCATC CG CT CAGTGGC CAT 

CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCA 

C CAAG CGGCACTGG CCAGGGGC CC CTGAGG AC CAAG AT GACAAGG ATGGTGGGGACTTC AGT 

GGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAAAGTGACACATCGGTGCTA 

CATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGCCTGGA 

AAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCTGCAGAACAAAATTAAGAACCTG 

AGGGAAGT CCGAGGTC AC CTGAAGAAAAAG CGG CCAGAAGAATGTGACTGTCACAAAATCAG 

CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGA 

AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGCGGGAGCAGAAGCGCAAGAAGAAACTC 

CGC^GCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCCAGGCCTCACGTGCTT 

CACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGCCT 

GCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTC 
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FIGURE 46B 

CTCTTCTGTGAATTTGCAACTGGCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCA 
GCTGATGAATGCAGTGAACACACTGGACAGGGATGTCCTCAACCAGCTACACGTACAGCTCA 
TGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAACCCCCGGACTCGAAACATGGACCTG 
GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAG 
ACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGTTAAGAAACAACAGAGG 
TGGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTG 
ATTTCCAGCAGACCTGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTC 
AACATGACAGATTCTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTT 
GCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCACAAAATGCATTTTTTCGTATCAA 
AAAGTCACCACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGA 
GATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 
AGCAGTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCA 
GGACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 
ACAAAAGAGATGACATTTACCTAGCACTATAAACCCTGGTTGCCTCTGAAGAAACTGCCTTC 
ATTGTATATATGTGACTATTTACATGTAATCAACATGGGAACTTTTAGGGGAACCTAATAAG 
AAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAAGAAAAA 
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FIGURE 47 

MGP PS LVLCLLS ATVFSLLGGS SAFLSHHRLKGRFQRDRRN I RPN 1 1 LVLTDDQDVELGSMQ 
VMNKTRR IMEQGGAHF INAFVTTPMCCP SRS S I LTGKYVHNHNT YTNNENCS S PSWQAQHE S 
RTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGSD 
YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITP 
SYNYAPNPDKHWIMRYTGPMKPIHMEFTNMLQRKRLQTLMSVD^ 

YIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDI 
AGLDI PADMDGKS I LKLLDTERPVNRFHLKKKMRWRDSFLVERGKIaLHKRDNDKVDAQEEN 
FLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNIiVPKY 
YGQGSEACTCDSGDYKLSLAGRRKKLFKKKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAAQ 
PRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANP I KVTHRCY I LENDTVQCDLDLYKS 
LQAWKDHKLH IDHE IETLQNKI KNLREVRGHLKKKRPEECDCHK I S YHTQHKGRLKHRGS SL 
HPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLG 
PFCACTSANNNTYWCMRTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQL 
HVQLMELRSCKGYKQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 
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FIGURE 48 

AACAAAGTTCAGTGACTCJAUACiG 

GCTGC TGGGCAGAGAGGGACTGTCCGGCTCCCAGATGCTGGGCCTCCTGGGGAGCACAGCCC 
TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCTGGCCACC 
TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 
CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCGGCGGCGGGGCCACCTGGGAATCTTTCACC 
ATCACCGTCATCCTGGCCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 
CCCCGCCACACCCCTCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCCGCCA 
CGCTCGCTQAGGCTGCTGTCGCCGGTGCCTGTGGACAGCAGCTGCCCCTGCCCTCCCATCTG 
TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 
GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 
AGCAGAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 
GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 
ACAAGCGTGTCTTGGCAGAGCCAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 
ATCAGGCTGCTGCAGGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 
GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 
AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 49 

MLGLLGSTALVGWITGAAVAVLLLLLLLATCLFHGRQDCDVERNRTAAGGNRVRRAQPWPFR 
RRGHLG I FHHHRHPGHVSHVPNVGLHHHHHPRHTPHHLHHHHHPHRHHPRHAR 
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FIGURE SO 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCATGTCGGACCTGCTA 
CTACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGG 
GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCA 
CTGTGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGC 
TGCAGCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATGGTGCC 
CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCC 
CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCCAGC 
CATGTGGTGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCCGCCG 
TGTCCATCCTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGG 
AGATCTACCAGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTAT 
GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCA 
GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCC 
CTGGCAGCCGGGAGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGAT 
GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGA 
GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 
CCCTGGGGACTACCAAGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGTAACCC 
ATGGCCTGCACCCTCCTGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCT 
CCAGCCCTCTTCCTCCTTCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 
TCCAGGCCTCTTGCTAAGCCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCA 



ACAGTGGAGCTTCCAGGACCCAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAA 
AAAAAAAAAA 
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FIGURE SI 

MSDLLLLGLIGGLTI^ 

LFTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPEIjIDLYQKFGFKVFS 
FPAPSHVVTATFPYTTILSIWIATRRVHPALDTYIKERKLCAYPRLEiyQEDQIHFMCPLAR 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGAS 
SRGWDDGDTRSEHSYSESGASGSSFEELI)LEGEGPLGESRLDPGTEPLGTTK^WE 
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FIGURE 52 

CCGCGGGAACGCTGTCCTGGCTGCCGCCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCT 
GCCCCGCGCCCAGTC^ACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 
GCTGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGA 
CCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGA 
GACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCT 
GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGA 
GTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 
GGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 
GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAG 
GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 
AG AC C C AAAGTCTC CAAAAAGAAG CT CAAGG AAGAGAAACGAAACAAG AG CAAAAAG AAAT& 
ftTAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 
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FIGURE 53 

HYTGSLVDGRIIDTSLTRDPLVIELGQKQVIPGLEQSLLDMCVGEKRRAI IPSHLAYGKRGF 
PPS VPADAWQ YDVEL I AL I RANYWLKLVKG I LP LVGMAMVPALLGL I GYHLYRKANRP KVS 
KKKLKEEKRNKSKKK 
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FIGURE 54 

cccgggaacgtgttcctggctgccgcacccgaacagcctgtcctggtgccccguctl:cctgc 
cccgcgcccagtcatgaccctgcgcccctcactcctcccgctccatctgctgctgctgctgc 
tgctcagtgcggcggtgtgccgggctgaggctgggctcgaaaccgaaagtcccgtccggacc 
ctccaagtggagaccctggtggagcccccagaaccatgtgccgagcccgctgcttttggaga 

CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 

CCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 

CTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTATGG 

AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 

TTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 

ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 

CCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAG 

AATAATAAATTTTAAAAAACTTA 
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FIGURE ! 
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CCGAAAGTCCCGT rCGGACCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCC 
GAGCCCGCTGCTTTTGGAGACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACG 
TATTATTGACACCTCCCTGACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGA 
TTCCAGGTCTGGAGCAGAGTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATT 
CCTTCTCACTTGGCCTATGGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGT 
GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGG 
GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCAC 
CTATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAA 
CAAGAGCAAAAAGAAATAATAAATAATAAATTTTAAAAAACTTAAAA 
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FICUME 56 

CTGCTGCATCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGGGGAG 
TGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACCACTGCACGACGGGGCTGGACTG 
ACCTGAAAAAAAISTCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 
GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGAT 
TATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 
GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 
GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGG 
TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTG 
CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATCTTT 
TTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAGTGAACACATCTGAT 
TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTTTTACTGCTCACTCCCAACCTT 
TTGTAATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 
AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 
ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 
TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 
TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 
CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 
TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 
GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 
GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 
CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 

CACATCCACCACTG 
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FIGURE 5 7 

MSGFLEGLRCSEC I DWGEKRNT IAS I AAGVLF FT'G WW 1 1 1 DAAVI Y PTMKD FNH S YHACGV I 
ATIAFLMINAVSNGQVRGDSYSEGCLGQTGARIWLFVGFMLAFGSL IASMWI LFGGYVAKEK 
D IVYPGIAVF FQNAF I F FGGLVFKFGRTE DLWQ 
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FIGURE 58 
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TTCTTGGCTAAAATCGGGGGAGTGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACC 
ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATG 
CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAC 
TATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGAT 
TTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 
AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 
CTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 
ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCTTCATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGC 
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FIGURE 59 

TGGACGGACCTGAAAAAAATGTTTGGATTTNTAGAGGGNTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCG CAAATACTATTGCTTCCATTG CTGCTGGTGTANTATTTTTTACAGG CTG 
GTGGATTATCATAGATG C AGNT GTTATTTATC CC ACCATGAAAGAT TT CAAC CANT CAT AC C 
ATGCCTGTGGTGTTATAGCAACCATAGCCTTCNTAATGATTAATGCAGTATCGAATGGACAA 
GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCTCGCATTTGGCTTTT 
CGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTT 
ATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGNTGTATTTTTCCAGAATGCCTTC 
ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGANTTATGGCAGTG 
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FIGURE m 

GGACACCGGGTTCCGGACCAATGCANGACGGGGTGGANTGACCTGAAAAAAATGTTTGGATT 
TTTAGAGGGCTTGAGATGNTCAGAATGCATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 
CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCATAGATGCAGCTGTTATT 
TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGC 
CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 
GTTTGGGTCAAACAGGTGNTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATTT 
CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTAT 
AC CCTGGAATTNCTNTATTTTTC CAGAATGCC 
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FIGURE 61 

TAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAG CG CAAT ANTATTGCTTC C 
ATTGNTGNTGGTGTANTATTTTTTTACAGGCTGGTGGATTATNATAGATGCAGCTGTTATTT 
ATCCCACCATGAAAGATTTNAACCANTCATACCATGCCTGTGGTGTTATAGCAACCATAGCC 
TTCCTAATGATTAATGCAGTATNGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTG 
TTTGGGTCAAACAGGTGNTNGCATTTGGCTTTTNGTTGGTTTCATGTTGGCCTTTGGATCTN 
TGATTGCATTTATGTGGATTNTTTTTGGAGGTTATGTTGCTAAAGNAAAAGACATAGTATAC 

CCTGT 
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FUCI JRE 62 
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GGGAGGCTGTGNCCGTTTTGTTTTN'^GGCTAAAATCGGGGGAGTGAGGCGGCCCGGCGCGG 
CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 
GATTTNTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 
GCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGT 
TATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCA 
TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 
GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGG 
ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 
TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTG 
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FIGURE 63 
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CGACGCCGGCGTGATQTGGCTTCCGCTGGTGCTGCTCCTGGCTGTGCTGCTGCTGGCCGTCC 

TCTGCAAAGTTTACTTGGGACTATTCTCTGGCAGCTCCCCGAATCCTTTCTCCGAAGATGTC 

AAACGGCCCCCAGCGCCCCTGGTAACTGACAAGGAGGCCAGGAAGAAGGTTCTCAAACAAGC 

TTTTTCAGCCAACCAAGTGCCGGAGAAGCTGGATGTGGTGGTAATTGGCAGTGGCTTTGGGG 

GCCTGGCTGCAGCTGCAATTCTAGCTAAAGCTGGCAAGCGAGTCCTGGTGCTGGAACAACAT 

ACCAAGGCAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGACACAGGAAT 

CCATTACATTGGGCGTATGGAAGAGGGCAGCATTGGCCGTTTTATCTTGGACCAGATCACTG 

AAGGGCAGCTGGACTGGGCTCCCCTGTCCTCTCCTTTTGACATCATGGTACTGGAAGGGCCC 

AATGGCCGAAAGGAGTACCCCATGTACAGTGGAGAGAAAGCCTACATTCAGGGCCTCAAGGA 

GAAGTTTCCACAGGAGGAAGCTATCATTGACAAGTATATAAAGCTGGTTAAGGTGGTATCCA 

GTGGAGCCCCTCATGCCATCCTGTTGAAATTCCTCCCATTGCCCGTGGTTCAGCTCCTCGAC 

AGGTGTGGGCTGCTGACTCGTTTCTCTCCATTCCTTCAAGCATCCACCCAGAGCCTGGCTGA 

GGTCCTGCAGCAGCTGGGGGCCTCCTCTGAGCTCCAGGCAGTACTCAGCTACATCTTCCCCA 

CTTACGGTGTCACCCCCAACCACAGTGCCTTTTCCATGCACGCCCTGCTGGTCAACCACTAC 

ATGAAAGGAGGCTTTTATCCCCGAGGGGGTTCCAGTGAAATTGCCTTCCACACCATCCCTGT 

GATTCAGCGGGCTGGGGGCGCTGTCCTCACAAAGGCCACTGTGCAGAGTGTGTTGCTGGACT 

CAGCTGGGAAAGCCTGTGGTGTCAGTGTGAAGAAGGGGCATGAGCTGGTGAACATCTATTGC 

CCCATCGTGGTCTCCAACGCAGGACTGTTCAACACCTATGAACACCTACTGCCGGGGAACGC 

CCGCTGCCTGCCAGGTGTGAAGCAGCAACTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 

CTGTTTTCATCTGCCTGCGAGGCACCAAGGAAGACCTGCATCTGCCGTCCACCAACTACTAT 

GTTTACTATGACACGGACATGGACCAGGCGATGGAGCGCTACGTCTCCATGCCCAGGGAAGA 

GGCTGCGGAACACATCCCTCTTCTCTTCTTCGCTTTCCCATCAGCCAAAGATCCGACCTGGG 

AGGACCGATTCCCAGGCCGGTCCACCATGATCATGCTCATACCCACTGCCTACGAGTGGTTT 

GAGGAGTGGCAGGCGGAGCTGAAGGGAAAGCGGGGCAGTGACTATGAGACCTTCAAAAACTC 

CTTTGTGGAAGCCTCTATGTCAGTGGTCCTGAAACTGTTCCCACAGCTGGAGGGGAAGGTGG 

AGAGTGTGACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 

TGCTACGGGGCTGACCATGACCTGGGCCGCCTGCACCCTTGTGTGATGGCCTCCTTGAGGGC 

CCAGAGCCCCATCCCCAACCTCTATCTGACAGGCCAGGATATCTTCACCTGTGGACTGGTCG 

GGGCCCTGCAAGGTGCCCTGCTGTGCAGCAGCGCCATCCTGAAGCGGAACTTGTACTCAGAC 

CTTAAGAATCTTGATTCTAGGATCCGGGCACAGAAGAAAAAGAATTAGTTCCATCAGGGAGG 

AGTCAGAGGAATTTGCCCAATGGCTGGGGCATCTCCCTTGACTTACCCATAATGTCTTTCTG 

CATTAGTTCCTTGCACGTATAAAGCACTCTAATTTGGTTCTGATGCCTGAAGAGAGGCCTAG 

TTTAAATCACAATTCCGAATCTGGGGCAATGGAATCACTGCTTCCAGCTGGGGCAGGTGAGA 

TCTTTACGCCTTTTATAACATGCCATCCCTACTAATAGGATATTGACTTGGATAGCTTGATG 

TCTCATGACGAGCGGCGCTCTGCATCCCTCACCCATGCCTCCTAACTCAGTGATCAAAGCGA 

ATATTCCATCTGTGGATAGAACCCCTGGCAGTGTTGTCAGCTCAACCTGGTGGGTTCAGTTC 

TGTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCACAGTTCTACACTGTCAAGG 

GAAAAGGGAGACTAATGAGGCTTAACTCAAAACCTGGGCGTGGTTTTGGTTGCCATTCCATA 

GGTTTGGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGTGGCTCTTCAGGGGACAGGAAAT 

GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 

GGGTGCATGTCAGATGATCATATCCAATTCATATGGAAGTCCCGGGTCTGTCTTCCTTATCA 

TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACCTGAGCCTCAATCAAGC 

CTTATCCACCAAATACACAGGGAAGGGTGATG CAGGGAAGGGTGAC ATCAGGAGTCAGGG CA 

TGGACTGGTAAGATGAATACTTTGCTGGGCTGAAGCAGGCTGCAGGGCATTCCAGCCAAGGG 

CACAGCAGGGGACAGTGCAGGGAGGTGTGGGGTAAGGGAGGGAAGTCACATCAGAAAAGGGA 

AAGCCACGGAATGTGTGTGAAG C C CAGAAATGGCATTTGCAGTTAATTAGCACATGTGAGGG 

TTAGACAGGTAGGTGAATGCAAGCTCAAGGTTTGGAAAAATGACTTTTCAGTTATGTCTTTG 

GTATCAGACATACGAAAGGTCTCTTTGTAGTTCGTGTTAATGTAACATTAATAAATTTATTG 

ATTCCATTGCTTTAAAAAAAAAAAAAAA 
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FIGURE 64 

MWLPLVLLLAVLLIiAVLCKVYLGLFSGSSPNPFSEDVKRPPAPLVTDKEARKKVLKQAFSAN 
QVPEKLDVWIGSGFGGLAAAAILAKAGKRVLVLEQHTKAGGCCHTFGKNGLEFDTGIHYIG 
RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKEKFPQ 
EEAIIDKYIKLVKWSSGAPHAILLKFLPLPWQLLDRCGLLTRFSPFLQASTQSLAEVLQQ 
LGASSELQAVLSYIFPTYGVTPNHSAFSMHALLVNHYMKGGFYPRGGSSEIAFHTIPVIQRA 
GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIWSNAGLFNTYEHLLPGNARCL? 
GVKQQLGTVRPGLGMTSVF I CLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 
IPLLFFAFPSAKDPTWEDRFPGRSTMIMLIPTAYEWFEEWQAELKGKRGSDYETFKNSFVEA 
SMSVVLKLFPQLEGKVESVTAGSPLTNQFYLAAPRGACYGADHDLGRLHPCVMASLRAQSPI 
PNLYLTGQDI FTCGLVGALQGALLCSS AI LKRNLYSDLKNLDSR I RAQKKKN 
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FIGURE 6S 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCTA 
GGGGTTGGCACCGGCCCCGAGAGGAGG&TgCGGGTCCGGATAGGGCTGACGCTGCTGCTGTG 
TGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGGATGAAGAAGGCAGCCAGGATGAATCCT 
TAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTAAAGGACCATACTACTGCAGGCAGA 
GTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAAT C TGAATT AGAATC CT CTATT CAAGA 
AGAGGAAGACAG CCTCAAGAG C CAAGAGGGGGAAAGTGTC ACAGAAGATATCAGCTTTCTAG 
AGTCTC CAAATC CAGAAAACAAGG ACTATGAAGAGC CAAAGAAAGTACGGAAACCAG CT TTG 
ACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCTTTTCCTAGATAA 
GGAGTATGAT GAAT GT AC ATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTGCTACAAC CT 
ATGACTAC AAAGCAGATGAAAAGTGGGG CTTTTGTGAAACTGAAGAAG AGG CTG CTAAGAGA 
CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAGCAA 
TAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 

CAGGCAGCGAGAGAGATGTTTGAGAAG CTG AC TG AGGAAGGCTCTC C CAAGGGACAGA CT GC 
TCTTGGCTTTCTGTATGCCrCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTAT 
ATTATACATTTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT 

J&gTGGAAGGCTAATAATATTAACAT CAGAAGAATTTGTGGT TTATAG CGGCCACAACTTTT 
TCAGCTTTCATGATCCAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAA 
ATTCTTGTTAATGG AT ATAACACATGGAAT CT AC ATGT AAATGAAAGTTGGTGGAGT C CACA 
ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 
TCTTTTTAAATTTTCTCTGAGTTGGAATTGTCAGAATCATTTTTTACATTAGATTATCATAA 
TTTTAAAAATTTTTCTTTAGTTTTTCAAAATTTTGTAAATGGTGGCTATAGAAAAACAACAT 
GAAATATTATACAATATTTTGCAACAATGCCCrrAAGAATTGTTAAAATTCATGGAGTTATTT 
GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTTC 
C(^TTTATTCTGGTCATTTATTGCTAGTGACACTGTGCCTGCTTCCAGTAGTCTCATTTTCC 
CTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGGAAGATTAACTCATTTTTAATAAA 
ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLD 
SEESELESS IQEEEDSLK3QEGESVTED I S FLES PNPENKDYEE PKKVRKPALTAI EGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGVNSSQAKALVYYTFGALGGNLIAHMVLVSRL 
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FITGURE 67 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 



GTCAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGG 
CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACTACGGTGTGTCCTGG 
TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCA 
CCACCGGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 
GTGTCCTCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGC 
TACGGCTTTAGTCCCT&QGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCT 
GCCCCTGACCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGG 
TTAATAATATTCAACATGTCAACAAC 
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FUGURE m 

MACRCLSFLLMGTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQUKACi 
SAPRYLLYYRSEEDHHRPADI PDRFSAAKDEAHNACVLT I SPVQPEDDADYYCSVGYGFS P 
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FIGURE 69 

GCCGCCCCGCCCCGAGACCGGGCCCGGGGGCGCGGGGCGGCGGGATGCGGCGCCCGGGGCGG 
CGATGACCGCGGAGCGCACGCCGCGGGCCCGGCCCTGACCCCGCCGCCCGCCCGCTGAGCCC 
CCCGCCGAGGTCCGGACAGGCCGAGATGACGCCGAGCCCCCTGTTGCTGCTCCTGCTGCCGC 
CGCTGCTGCTGGGGGCCTTCCCACCGGCCGCCGCCGCCCGAGGCCCCCCAAAGATGGCGGAC 
AAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGCCGCACTGTGCGGCTGCAGTGCCCAGTGGA 
GGGGGACCCGCCGCCGCTGACCATGTGGACCAAGGATGGCCGCACCATCCACAGCGGCTGGA 
GCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTGGAGCGGGAGGATGCCGGC 
GTGTACGTGTGCAAGGCCACCAACGGCTTCGGCAGCCTGAGCGTCAACTACACCCTCGTCGT 
GCTGGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTGGGGGTCAAG 
AGGACCCCGCCAGCCAGCAGTGGGCACGACCGCGCTTCACACAGCCCTCCAAGATGAGGCGC 
CGGGTGATCGCACGGCCCGTGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCGGGCACCC 
TCGGCCCGACATCACGTGGATGAAGGACGAC CAGGCCTTGACGCGC C CAGAGG C CGCTGAG C 
CC AGGAAGAAGAAGTGGACACTGAGCCTGAAG AACCTG CGG C CGGAGGACAGCGGCAAATAC 
ACCTGCCGCGTGTCGAACCGCGCGGGCGCCATCAACGCCACCTACAAGGTGGATGTGATCCA 
GCGGACCCGTTCCAAGCCCGTGCTCACAGGCACGCACCCCGTGAACACGACGGTGGACTTCG 
GGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGACGTGAAGCCGGTGATCCAGTGGCTG 
AAGCGCGTGGAGTACGGCGCCGAGGGCCGCCACAACTCCACCATCGATGTGGGCGGCCAGAA 
GTTTGTGGTGCTGCCCACGGGTGACGTGTGGTCGCGGCCCGACGGCTCCTACCTCAATAAGC 
TGCTCATCACCCGTGCCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACC 
ATGGGCTACAGCTTCCGCAGCGCCTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCC 
ACCTGTGGCCTCCTCGTCCTCGGCCACTAGCCTGCCGTGGCCCGTGGTCATCGGCATCCCAG 
CCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGCTTTGCCAGGCCCAGAAGAAGCCG 
TGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCGGGGACGGCCCGCGACCG 
CAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTGTGGGGCTGT 
GTGAGGAGCATGGGTCTCCGGCAGCCCCCCAGCACTTACTGGGCCCAGGCCCAGTTGCTGGC 
CCTAAGTTGTACCCCAAACTCTACACAGACATCCACACACACACACACACACACTCTCACAC 
ACACTCACACGTGGAGGGCAAGGTCCACCAGCACATCCACTATCAGTGCTAGACGGCACCGT 
ATCTGCAGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGCT 
GCAGACGAAGGCAGGGGACCCATGGCGAGGAGGAATGGCCAGCACCCCAGGCAGTCTGTGTG 
TGAGGCATAGCCCCTGGACACACACACACAGACACACACACTACCTGGATGCATGTATGCAC 
ACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGCACACGCACATGCACAGATATG 
CCGCCTGGGCACACAGATAAGCTGC CCAAATG CACG C ACACGC ACAGAGACATGC CAGAACA 
TACAAGGACATGCTGCCTGAACATACACACGCACACCCATGCGCAGATGTGCTGCCTGGACA 
CACACACACACACGGATATGCTGTCTGGACGCACACACGTGCAGATATGGTATCCGGACACA 
CACGTGCACAGATATGCTGCCTGGACACACAGATAATGCTGCCTTGACACACACATGCACGG 
ATATTGCCTGGACACACACACACACACACGCGTGCACAGATATGCTGTCTGGACACGCACAC 
ACATGCAGATATGCTGCCTGGACACACACTTCCAGACACACGTGCACAGGCGCAGATATGCT 
GCCTGGACACACGCAGATATGCTGTCTAGTCACACACACACGCAGACATGCTGTCCGGACAC 
ACACACGCATGC ACAGATATGCTGTC CGGACACACACACG CACG CAGATATGCTG CCTGGAC 
ACACACACAGATAATGCTGCCTCAACACTCACACACGTGCAGATATTGCCTGGACACACACA 
TGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATATGCTGTCCGGATACACACG 
CACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGACACACATGCACACACAGGT 
GCAGATATGCTGCCTGGACACACACACAGATAATGCTGCCTCAACACTCACACACGTGCAGA 
TATTGCCTGGACACACACATGTGCACAGATATGCTGTCTGGACATGCACACACGTGCAGATA 
TGCTGTCCGGATACACACGCACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGA 
CACACATGCACACACAGGTGCAGATATGCTGCCTGGACACACGCAGACTGACGTGCTTTTGG 
GAGGGTGTGCCGTGAAGCCTGCAGTACGTGTG CCGTGAGG CT CATAGTTGATGAGGGACTTT 
CCCTGCTCCACCGTCACTCCCCCAACTCTGCCCGCCTCTGTCCCCGCCTCAGTCCCCGCCTC 
CATCCCCGCCTCTGTCCCCTGGCCTTGGCGGCTATTTTTGCCACCTGCCTTGGGTGCCCAGG 
AGTCCCCTACTGCTGTGGGCTGGGGTTGGGGGCACAGCAGCCCCAAGCCTGAGAGGCTGGAG 
CCCATGGCTAGTGGCTCATCCCCAGTGCATTCTCCCCCTGACACAGAGAAGGGGCCTTGGTA 
TTTATATTTAAGAAATGAAGATAATATTAATAATGATGGAAGGAAGACTGGGTTGCAGGGAC 
TGTGGTCTCTCCTGGGGCCCGGGACCCGCCTGGTCTTTCAGCCATGCTGATGACCACACCCC 
GTCCAGGCCAGACACCACCCCCCACCCCACTGTCGTGGTGGCCCCAGATCTCTGTAATTTTA 
TGTAGAGTTTGAGCTGAAGCCCCGTATATTTAATTTATTTTGTTAAACACAAAA 
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MTPSPLLLLLLPPLLLGAFPPAAAARGPPKMADKWPRQVARLGRTVRLQCPVEGDPPPLTM 
WT KDGRT I H S GWS RFRVL P QGL KVKQ V E RE DAGV YVCKATNG FG S L S VNY T L WL D D I S P G K 
ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITWMK 
DDQALTRPEAAEPRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 
TGTHPVNTTVDFGGTTSFQCKVRSDVKPVIQWLKRVEYGAEGRHNSTIDVGGQKFWLPTGD 
VWSRPDGSYLNKLLITRARQDDAGMYICLGANTMGYSFRSAFLTVLPDPKPPGPPVASSSSA 
TSLPWPWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 
LAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSHVEGKV 
HQHIHYQC 
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FIGURE 71A 

CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCG 
CATTCC/.GAGTCAGTGACTCTGTGAAGCACCCACATCTACCTCTTGCCACGTTCCCACGGGC 
TTGGGGGAAAGATOGTGGGGACCAAGGCCTGGGTGTTCTCCTTCCTGGTCCTGGAAGTCACA 
TCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTAAGAAGAGTCCAGCCTGGGAAGAA 
GAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAGTGGACAACAT 
GGTTCAACATCGACTACCCAGGCGGGAAGGGCGACTATGAGCGGCTGGACGCCATTCGCTTC 
TACTATGGGGACCGTGTATGTGCCCGTCCCCTGCGGCTAGAGGCTCGGACCACTGACTGGAC 
ACCTGCGGGCAGCACTGGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCA 
ACAGGGAGCAGCGGCCTGGCCAGAACTGCTCTAATTACACCGTACGCTTCCTCTGCCCACCA 
GGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGGTCTCCCTGGAGCAAGTGCTC 
AGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTTGGCAGAGATGGTGT 
CGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGC ACTG CATGGGCCAGGACTGT AC AG C C TGT 
GACCTGACCTGCCCAATGGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTGCCAGGACTT 
CATGCTTCATGGGGCTGTCTCCCTTCCCGGAGGTGCCCCAGCCTCAGGGGCTGCTATCTACC 
TCCTGACCAAGACGCCGAAGCTGCTGACCCAGACAGACAGTGATGGGAGATTCCGAATCCCT 
GGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTCAAGTTTGCCCCCATTGT 
ACTCACAATGCCCAAGACTAGCCTGAAGGCAGCCACCATCAAGGCAGAGTTTGTGAGGGCAG 
AGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCT 
CTGTGCTGTAAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCATAATGACAC 
ATTGCTGGATCCTTCCCTCTACAAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGC 
ACCAGGCTGGGGAGTACTTTTGCAAGGCCCAGAGTGATGCTGGGGCTGTGAAGTCCAAGGTT 
GCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTTCCTGAGAGCTATCT 
TATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGAC 
GCTGCCCTGTTAAGACTTGTGCAGGGCAGCAGGATAATGGGATCAGGTGCCGTGATGCTGTG 
CAGAACTGCTGTGGCATCTC CAAGACAGAGGAAAGG GAGATC C AGTGC AGTGGCTACACGCT 
ACCCACCAAGGTGGCCAAGGAGTGCAGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGC 
GGGGCCGTGTCAGTGCTGCTGACAATGGGGAGCCCATGCGCTTTGGCCATGTGTACATGGGG 
AACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTCCCCCAGGACAC 
TGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGC 
TACCTTTCAACAAGAAGGGGAGTG CCGTGTTCCATGAAATCAAGATGCTT CGTCGGAAAGAG 
CCCATCACTTTGGAAGCCATGGAGACCAACATCATCCCCCTGGGGGAAGTGGTTGGTGAAGA 
C CC CATGG CTGAACTGGAGATTC CATC CAGGAGTT T CTACAGG CAGAATGGGGAGCCCTACA 
TAGGAAAAGTGAAGGCCAGTGTGACCTTCCTGGATCCCCGGAATATTTCCACAGCCACAGCT 
GCCCAGACTGACCTGAACTTCATCAATGACGAAGGAGACACTTTCCCCCTTCGGACGTATGG 
CATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCACTTAATGCTGGCAAAGTGA 
AGGTCCACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGG 
TCACTCAATCCAGACACAGGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATCAAAG 
GAGGAACAAAAGAGAAGACAGAACCTTCCTGGTGGGCAACCTGGAGATTCGTGAGAGGAGGC 
TCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAGGTGAGGGCCTACCGGAGT 
GAGAGGTTCTTGCCTAGTGAGCAGATCCAGGGGGTTGTGATCTCCGTGATTAACCTGGAGCC 
TAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCC 
CCAACGGGGCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTAT 
GTCTTGGCAAGCCTGGCTGGGGAGG AACTGCAAGCAGTGGAGT CTT CTC CT AAATTCAAC CC 
AAATGCAATTGGCGTCCCTCAGCCCTATCTCAACAAGCTCAACTACCGTCGGACGGACCATG 
AGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAAGCCAAGGCCCAACTCA 
GCTGAGGAGAGCAATGGGCCCATCTATGCCTTTGAGAACCTCCGGGCATGTGAAGAGGCACC 
ACCCAGTGCAGCCCACTTCCGGTTCTACCAGATTGAGGGGGATCGATATGACTACAACACAG 
TCCCCTTCAACGAAGATGACCCTATGAGCTGGACTGAAGACTATCTGGCATGGTGGCCAAAG 
CCGATGGAATTCAGGGCCTGCTATATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGT 
GCGATCCCGCAACATGGGGGGCACTCATCGGCGGACAGTGGGGAAGCTGTATGGAATCCGAG 
ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGT CT CAGCTGCCTGTCTGGAGTTCAAG 
TGCAGTGGGATGCTCTATGATCAGGACCGTGTGGACCGCACCCTGGTGAAGGTCATCCCCCA 
GGGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAGTACCTGGTCAACCACTTGC 
CACTTGCAGTCAACAACGACACCAGTGAGTACACCATGCTGGCACCCTTGGACCCACTGGGC 
CACAACTATGGCATCTACACTGTCACTGACCAGGACCCTCGCACGGCCAAGGAGATCGCGCT 
CGGCCGGTGCTTTGATGGCAGATCCGATGGCTCCTCCAGAATCATGAAGAGCAATGTGGGAG 
TAGCCCTCACCTTCAACTGTGTAGAGAGGCAAGTAGGCCGCCAGAGTGCCTTCCAGTACCTC 
CAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAG 
GCAGCAG CGAG CGAGCAGGGGTGGCCAGCGCCAGGGTGGAGTGGTGGCCTCTCTGAGATTTC 
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CTAGAGTTGCTCAACAGCCCCTGATCAACIMGTTTTGTGGTACTTCACCCTCTTCTGCCCT 
CATTTCATGTGACAGCCATTGTGAGACTGATGCACAAACTGTCACTTGGTTAATTTAAGCAC 
TTCTGTTTTCGTGAATTTGCTTGTTTGTTTCTTCATGCCTTTACTTACTTTGTCCCATGCTA 
CTGATTGGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTCTTTAAA 
TGAAACACAAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATGC 
CATTAATGCAAATATACTTCCTCTTCTTTTTGCATGGTTTTGCCCACCTCTGCAATAGTGAT 
AATCTGATGCTGAAGATCAAATAACCAATATAAAGCATATTTCTTGGCCTTGCTCCACAGGA 
CATAGGCAAGCCTTGATCATAGTTCATACATATAAATGGTGGTGAAATAAAGAAATAAAACA 
CAATACTTTTACTTGAAATGTAAATAACTTATTTATTTCTTTGCTAAATTTGGAATTCTAGT 
GCACATTCAAAGTTAAGCTATTAAATATAGGGTGATCATAGTTCCTCTACCAAGTCTGGAAA 
GAACATCTCCTGGTATCCACAATTACACCAGGTTGCTAACTGTATTTGTACATTTCCCTTTG 
CATTCGCTTTTGTTCTTGCTAGAAACCCAGTGTAGCCCAGGGCAGATGTCAATAAATGCATA 
CTCTGTATTTCGAAAAAA 
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MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPADTLESPGEWTTWFNI 
DYPGGKGDYERLDAIRFTYGDRVCARPLRLEARTTDWTPA 

RPGQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRICLAEMVSLCS 
EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTK 
TPKLLTQTDSDGRFRIPGLCPDGKSILKITKVKFAPIVLTMPKTSLKAATIKAEFVRAETPY 
MVMNPETKARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAG 
EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 
KTCAGQQDNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRV 
SAADNGEPMRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFN 
KKGS AVFHE I KMLRRKE P I TLEAMETNI I PLGEWGED PMAE LE I PSRS F YRQNGE PY I GKV 
KASVTFIJDPRNI STATAAQTDUSHF ^ 

DSTQVKMPEHISTVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNL 
DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 
CVPAFCDDQS PDAYS AYVLASLAGEELQ AVES S P KFNPNA I G VP QP YLNKLNYRRTDHEDPR 
VKKTAFQISMAKPRPNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFN 
EDDP^ WTEDYLAWWPKPMEFRAC YI KVKI VG PLEV1A/RSRNMGGTHRRTVGKLYG IRDVRS 
TRDRDQPNVSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAV 
NNDTSEYTMLAPLDPLGHNYGIYTVTDQDPRTAKEIALGRCFDGTSIXjSSRIMKSNVGVALT 
FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVA 
QQPLIN 
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CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 
CTGAATACGCACAATATCTTAACTCTTCATATTTGGTTTTGGGATCTGCTTTGAGGTCCCAT 
CTTCATTTAAAAAAAAATACAGAGACCTACCTACCCGTACGCATACATACATATGTGTATAT 
ATATGTAAACTAGACT^AAGATCGCAGATCATAAAGCAAGCTCTGCTTTAGTTTCCAAGAAGA 
TTACAAAGAATTTAGAG ATG TATTTGTCAAGATCCCTGTCGATTCATGCCCTTTGGGTTACG 
GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGATTTGTGTAAGACTCA 
GATTTACACGGAAGAAGGGAAAGTTTGGGATTACATGGCCTGCCAGCCGGAATCCACGGACA 
TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGACCCTCCTGAG 
ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCGAGTACCCCTGA 
GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCTCCACATTTTGGC 
AGTCTGCCACTTGGAAGGAGTATCCCAAGCCTCTCCAGGTTAACATCACTCTGTCTTGGAGC 
AAAACCATTGAGCTAACAGACAACATAGTTATTACCTTTGAATCTGGGCGTCCAGACCAAAT 
GATCCTGGAGAAGTCTCTCGATTATGGACGAACATGGCAGCCCTATCAGTATTATGCCACAG 
ACTGCTTAGATGCTTTTCACATGGATCCTAAATCCGTGAAGGATTTATCACAGCATACGGTC 
TTAGAAATCATTTGCACAGAAGAGTACTCAACAGGGTATACAACAAATAGCAAAATAATCCA 
CTTTGAAATCAAAGACAGGTTCGCGCTTTTTGCTGGACCTCGCCTACGCAATATGGCTTCCC 
TCTACGGACAGCTGGATACAACCAAGAAACTCAGAGATTTCTTTACAGTCACAGACCTGAGG 
ATAAGGCTGTTAAGACCAGCCGTTGGGGAAATATTTGTAGATGAGCTACACTTGGCACGCTA 
CTTTTACGCGATCTCAGACATAAAGGTGCGAGGAAGGTGCAAGTGTAATCTCCATGCCACTG 
TATGTGTGTATGACAACAGCAAATTGACATGCGAATGTGAGCACAACACTACAGGTCCAGAC 
TGTGGGAAATGCAAGAAGAATTATCAGGGCCGACCTTGGAGTCCAGGCTCCTATCTCCCCAT 
CCCCAAAGGCACTGCAAATACCTGTATCCCCAGTATTTCCAGTATTGGTACGAATGTCTGCG 
ACAACGAGCTCCTGCACTGCCAGAACGGAGGGACGTGCCACAACAACGTGCGCTGCCTGTGC 
CCGGCCGCATACACGGGCATCCTCTGCGAGAAGCTGCGGTGCGAGGAGGCTGGCAGCTGCGG 
CTCCGACTCTGGCCAGGGCGCGCCCCCGCACGGCACCCCAGCGCTGCTGCTGCTGACCACGC 
TGCTGGGAACCGCCAGCCCCCTGGTGTTCTAGGTGTCACCTCCAGCCACACCGGACGGGCCT 
GTGCCGTGGGGAAGCAGACACAACCCAAACATTTGCTACTAACATAGGAAACACACACATAC 
AGACACCCCCACTCAGACAGTGTACAAACTAAGAAGGCCTAACTGAACTAAGCCATATTTAT 
CACCCGTGGACAGCACATCCGAGTCAAGACTGTTAATTTCTGACTCCAGAGGAGTTGGCAGC 
TGTTGATATTATCACTGCAAATCACATTGCCAGCTGCAGAGCATATTGTGGATTGGAAAGGC 
TG CG ACAG C C CC CC AAAC AGG AAAGACAAAAAACAAAC AAAT CAAC CG AC CT AAAAAC ATTG 
GCTACTCTAGCGTGGTGCGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAAATAGCA 
TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGCTGCCATATTGGCCT 
GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTGTAACCCTCGTT 
GG TTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAAC AG CCCCCTCTAAAAGC 
GCAAGCCAGTCATACCCCTGTATATCTTAGCAGCACTGAGTCCAGTGCGAGCACACACCCAC 
TATACAAGAGTGGCTATAGGAAAAAAGAAAGTGTATCTATCCTTTTGTATTCAAATGAAGTT 
ATTTTTCTTGAACTACTGTAATATGTAGATTTTTTGTATTATTGCCAATTTGTGTTACCAGA 
CAATCTGTTAATGTATCTAATTCGAATCAGCAAAGACTGACATTTTATTTTGTCCTCTTTCG 
TTCTGTTTTGTTTCACTGTGCAGAGATTTCTCTGTAAGGGCAACGAACGTGCTGGCATCAAA 
GAATATCAGTTTACATATATAACAAGTGTAATAAGATT CCAC CAAAGGACATTCTAAATGTT 
TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATAAACGATTTCAGG 
AATTTGTATTGCAATTTCTTAAGATGAAAGGAACAGCCACCAAGCAGTTTCACACTCACTTT 
ACTGATTT CTGTGTGG AC TG AG TACATTCAG CTG ACGAATTTAGTTC C CAGGAAGATGGATT 
GATGTTCACTAGCTTGGACAACTTCTGCAAAATATGAGACTATTTCCACTTGGGAAAAATTA 
CAAC AG CAAAAAAAAAAAAAAAAAAAAAA 
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MYLS RS LS I HALWVTVSS VMQ P YPL VWGH YDL CKTQ I YTE EGKVWD YMACQ PES TDMTKYLK 
VKLDPPDITCGDPPETFCAMGNPY^ 

EYPKPLQVNITLSWSKTIELTDNIVITFESGRPDQMILEKSLDYGRTWQPYQYYATDCLDAF 
HMDPKSVKDLSQHTVLEI ICTEEYSTGYTTNSKI IHFEIKDRFALFAGPRLRNMASLYGQLD 
TTKKLRDFFTVTDLRIRLLRPAVGEIFVDELHLARYFYAISDIKVRGRCKCNLHATVCVYDN 
SKLTCECEHNTTGPDCGKCKKNYQGRPWSPGSYLPIPKGTANTCIPSISSIGTNVCDNELLH 
CQNGGTCHNNVRCLCPAAYTGILCEKLRCEEAGSCGSDSGQGAPPHGTPALLLLTTLLGTAS 
PLVF 
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CCCACGCGTCCGGGTGACCTGGGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 
CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTCATGACTT 
CTCTTGTGGACCATGTCCGTGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGGACTGCC 
CCTCTCAGCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACGGCTCA 
AGAGTTTAGCCTTGCGACTGGCCCAGTATCCAGGTCGAGGTTCTGCAGAAGGTTGTGACTTT 
AGTATACATTTTTCTTCTTTCGGGGACGTGGCCTGCATGGCTATCTGCTCCTGCCAGTGTCC 
AGCAGCCATGGCCTTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 
CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAG 
AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAAAAT 
TCAGGAGGAGCTCAAGTTGCAGCCTCCAGCGGTTCTCACTCTGGAGGACACAGATGTGGCAA 
ATGGGGTGATGAATGGTCACACACCGATGCACTTGGAGCCTGCTCCTAATTTCCGAATGGAA 
CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCT 
CATTCGAGGAGTTCACCTTGCAGAACATTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 
TGGACCAAACCTCGIS&GCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 
CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 
AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 
ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGTAAAATTAACAAATCCATTCTAT 
TCCTCTGACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT 
GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 
TTAATGGGCTTTTCTACTAAAAGCATAAAATACTGAGGCTGATTTAGTCAGGGCAAAACCAT 
TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTTCTTACGGTTTTC 
TTGTAACAATAAATATTTTGAGTAAATAATGGGTACATTTTAACAAACTCAGTAGTACAACC 
TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 
AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTTTATTAAAAAAAAAAAAAAAAAAAA 
AAAAG 
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MSVIFFACWRVRDGLPLSASTDFYHTQDFLEWRRRLKSLALRLAQYPGRGSAEGCDFSIHF 
SSFGDVACMAI CSCQCPAAMAFGFLETLWWEFTAS YDTTC I GLASRPYAFLE FDS I IQKVKW 
HFNYVSS SQMECSIjEKI QEELKLQP P AVLTLEDTDVANGVMNGHTPMHLE PAPNFRME PVTA 
LGILSLILNI MCAALNL I RGVHLAEHSLQDP RS WFCWLDQTS 
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FIGURE 77 

TGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCC^TACGCTTTTCTTGAGTTTGACAGCATCATTCAGAAAGTGAAGTGGCATT 
TTAACTATGTAAGTTCCTNTCAGATGGAGTGCAGCTTGGAAAAAATTCAGGAGGAGCTCAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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CTCAGCGGCGCTTCCTCGTAGCGAGCCTAGTGGCGGGTGTTTGCATTGAAACGTGAGCGCGA 
CC CGAC CTTAAAGAGTGGGGAGCAAAGGGAGGACAGAG C C CTTTAAAACGAGGCGGGTGGTG 
CCTGCCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTT 
TCTGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGGTGCTTGGCGGCGGCGGCTT 
CCTCCCCGCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTA 
TGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGC 
GAGTGTATTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGAC 
CCGCTTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGA 
TTGCGCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCC 
TTCTCCATCATCAGCAATGAGGTGCTGCTCTCCCTGCCTCGGAACTACTACATCCAGTGGCT 
CAACGGCTCCCTCATCCATGGCCTCTGGAACCTTGTTTTTCTCTTCCCCAACCTGTCCCTCA 
TCTTCCTCATGCCCTTTGCATATTTCTTCACTGAGTCTGAGGGCTTTGCTGGCTCCAGAAAG 
GGTGTCCTGGGCCGGGTCTATGAGACAGTGGTGATGTTGATGCTCCTCACTCTGCTGGTGCT 
AGGTATGGTGTGGGTGGCATCAGCCATTGTGGACAAGAACAAGGCCAACAGAGAGTCACTCT 
ATGACTTTTGGGAGTACTATCTCCCCTACCTCTACTCATGCATCTCCTTCCTTGGGGTTCTG 
CTGCTCCTGGTGTGTACTCCACTGGGTCTCGCCCGCATGTTCTCCGTCACTGGGAAGCTGCT 
AGTCAAGCCCCGGCTGCTGGAAGACCTGGAGGAGCAGCTGTACTGCTCAGCCTTTGAGGAGG 
CAGCCCTGACCCGCAGGATCTGTAATCCTACTTCCTGCTGGCTGCCTTTAGACATGGAGCTG 
CTACACAGACAGGTCCTGGCTCTGCAGACACAGAGGGTCCTGCTGGAGAAGAGGCGGAAGGC 
TTCAGCCTGGCAACGGAACCTGGGCTACCCCCTGGCTATGCTGTGCTTGCTGGTGCTGACGG 
GCCTGTCTGTGCTCATTGTGGCCATCCACATCCTGGAGCTGCTCATCGATGAGGCTGCCATG 
CCCCGAGGCATGCAGGGTACCTCCTTAGGCCAGGTCTCCTTCTCCAAGCTGGGCTCCTTTGG 
TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCAGTTGTGGGCTTCTATA 
GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACGACACTGCCATGACGCAGATAATT 
GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGCACTTCCTGTCTTCTCTCGAACCCTGGG 
GCTCACTCGCTTTGACCTGCTGGGTGACTTTGGACGCTTCAACTGGCTGGGCAATTTCTACA 
TTGTGTTCCTCTACAACGCAGCCTTTGCAGGCCTCACCACACTCTGTCTGGTGAAGACCTTC 
ACTGCAGCTGTGCGGGCAGAGCTGATCCGGGCCTTTGGGCTGGACAGACTGCCGCTGCCCGT 
CTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGTGACCTCCAGCTGGGGGTGGGA 
AGGAAAAAACTGGACACTGCCATCTGCTGCCTAGGCCTGGAGGGAAGCCCAAGGCTACTTGG 
ACCTCAGGACCTGGAATCTGAGAGGGTGGGTGGCAGAGGGGAGCAGAGCCATCTGCACTATT 
GCATAATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATACTTAACTGTGGCCT 
CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACATCA 
ATCTGCCTCACTGCTGTTCTGGGCCATCCCCATAGCCATGTTTACATGATTTGATGTGCAAT 
AGGGTGGGGTAGGGGCAGGGAAAGGACTGGGC CAGGG CAGG CTCGGGAGATAGATTGTCT C C 
CTTGCCTCTGGCCCAGCAGAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACCACCTG 
AAAGACCAAGGGGATAGGGAGGAGGAGGCTTC AG C CAT CAGC AATAAAGTTGATCC CAGGGA 
AAAAAA 
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FIGURE 79 

MfiAPDyBvLSVRliUb^HBHiKECi ISTbL. FATLY li^CHlFljTKFKKPAhiFTTVDJJEDAT VNK 
IALELCTFTLAIALGAVLLLPFSI ISNEVLLSLPRNY\ : IQWLNGSLIHGLWNLVFLFPNLSL 
IFIJ4PFAYFFTESEGFAGSRKGVLGRVYETVVMLMLLTLLVLG 

YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEEQLYCSAFEE 
AALTRRICNPTSCWLPLDMELLHRQVLALQTQRVLLEKRRKASAWQRNLGYPLAMLCLLVLT 
GLSVLIVAIHILELLIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQWLIFYLMVSSWGFY 
SSPLFRSLRPRWHDTAMTQIIGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 
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FIGURE 80 

GGCTGCCGAGGGAAGGCCCCTTGGGTTGGTCTTGGTTGCTTGGCGGCGGCGGNTTCNTCCCC 

ACCTGACTACGAAGTG CTAT CCGTGCGAGAACAGCTATTC CACGAGAGGATCCGCGAGTGTA 
TTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGACCCGCTTC 
AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 
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FIGURE M 

GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 
CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 
TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGCTTCCT 
CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 
AGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGT 
GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 
TTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGATTGC 
GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 
C CAT CAT CAG CAATGAGGTGCTGCACT C C C 
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FIGURE 82 

GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTT 
GGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTCATGCTGCTGT 
GGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAGGCCCATT 
ATTTTCCTCCAGCCTCCATGGACCACAGTCTTCCAAGGAGAGAGAGTGACCCTCACTTGCAA 
GGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAA 
TACTAAG AGAAACCCCAGACAATAT CCTTGAGGT TC AGGAAT CTGGAGAGTACAGATGCCAG 
GCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCC 
TCATGCTGCCCAGGCTAATGTTGAACTCCTGGGCTCAAGTGATCTGCTCACCTAGGCCTCTC 
AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 
TCTGTGGTTCTGAGGTGCCGGGCAAAGGCGGAAGTAACACTGAATAATACTATTTACAAGAA 
TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MLLWVILLVLAPVSCXJFARTPRPilf 

GKEILRETPDNILEVQESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELLGSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGT 
GAGGCGGCCTCACAGGGCCGGGTGGGCl'GGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAG 
GAGTGTGTGGAACAGGACCCGGGACAGAGGAACC ATGG CTCCGCAGAACCTGAGCACCTTTT 
GCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTG 
GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCA 
GCTTCATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTG 
CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAA 
GGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTT 
TGGTTTCATGTTTGGAGGAACC C CTCGT CAGCAAGACAGAAATATT C CAAGAGGAAGTGATA 
TTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTT 
AGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGAT 
GCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAAT 
GC CCTAATGTCAAACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGC CTGGGGTG 
AGAGACGGCATGGAGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGG 
AGATTTACGGTTCCGAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATT 
TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACT 
CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCT 
ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAA 
TCACTTTTGATGTGGATTTTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAA 
CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATGGACTGCAAGGATATTGAGAGTG 
AATAAAATTGGACTTTGTTTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTT 
TTGTGTGTGTTTTTGTTTTTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGA 
TCATCATGAAATGAATAAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACC 
AGCAAAAGGTTTACTAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGT 
TT CAAGAATT AAAGCTGCAAGAGGACTCCAGGAG CAAAAGAAAC ACAATATAGAGGGTTGGA 
GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTG 
TTATTTTTA 
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FIGURE 85 

MAPQNLSTFCLLLLYLIGAVI AGRDFYKILGVPRSAS I KD IKXAYRKLALQIjHPDRNPDDPy 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQS SHGD I FSHFFGDFGFMFGGTPRQQ 
DRNI PRGSDI I VDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLRFRIKVVKH 
PIFE RRGDDL YTNVT I SLVESL VGFEMD I THLDGHKVH I S RDK I TRPGAKLWKKGEGLPNFD 
NNN I KGSL 1 1 TFDVDF PKEQLTE EAREG I KQLL KQGS VQKVYNGLQGY 
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FIGURE 86 

TGGGACCAGGGAACCCCGGGCCCCCCGGTGGAGNGCCTAACAGGCCGGTGGNTGCGACCGAA 
.. ,GCGGCGGGCGGAGGAGGTTTTGAG(^TTTTTGGAACA 

CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTCATCGGGGCGGTGATTGCCGG 
ACGAGATTTNTATAAGATTTTGGGGTG C CTNGAAGT GC CTTNTATAAAGGATATTAAAAAGG 
CCTATAGGAAACTAGCCCTGCAGNTTTATCCCGACCGGAACCCTGATGATCCACAAGCCCAG 
GAGAAATTC CAGGATTTGGGTG CTGCTTATGAGGTTNTGTCAGATAGTGAGAAACGGAAACA 
GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNAT CAGAG CTC C CATGGAGACATT T 
TTTCACACTTNTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTNGTCAGCAAGACAGA 
AATATTCCAAGAG 
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GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 
CTCTGCAGGTGTCGTGGAGGAACCTAGCACCTGCCATCCTCTTCCCCAATTTGCCACTTCCA 
GCAGCTTTAGCCCATGAGGAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCATGG 
AGACTGTGGTGATTGTTGCCATAGGTGTGCTGGC CACC AT CTTTCTGGCTTCGTTTGCAGCC 
TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATGATTC 
TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTCTGAGTTAGAAC 
TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 
GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCACACTCTGAC 
AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 
GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 
TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 
TCACCTGGTGCTGGTGACAAGGAATGCCTGCCATCTGACGGGAGGCCTGGACTGGATTGACC 
AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 
CCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCCTGCAGGAGCAGTCTGCAATTTAGTGCCT 
ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 
CTACTTTTTCCTATAGAGTTAGTTGTTCTCCACGGCTGGAGAGTTCAGCTGTGTGTGCATAG 
TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 
GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA 
CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTCAAGAAAATTGAGCCACCGTCT 
AAGAAATCAAGAGGTTTCACATTAAAATTAGAATTTCTGGCCTCTCTCGATCGGTCAGAATG 
TGTGGCAATTCTGATCTGCATTTTCAGAAGAGGACAATCAATTGAAACTAAGTAGGGGTTTC 
TTCTTTTGGCAAGACTTGTACTCTCTCACCTGGCCTGTTTCATTTATTTGTATTATCTGCCT 
GGTCCCTGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 
ACAAAGTTGATGATTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGTG 
GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE 88 

METWIVAIGVLATIFLASFAALVLVCRQRYCRPRDIJjQRYDSKPIVDLIGAMETQSEPSE^ 

VSDIIWAKRISPRVDDWKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLTGGLDWI 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 
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FIGURE 89 



GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCC 
CCACCACTGCAGCC ATGA TCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAACAGGA 
TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACTGGCTAT 
TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 
TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 
CTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTTTCTCTTGTTCAG 
GGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAC^GTGCCAGTCCTTGGATCCCTCCTAAAT 
TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAACAATATGGTAT&ACAACA 
AGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAAAGTCATTTGAAGAATATTCA 
GCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTACAGGAGTTTAAAACGTATAG 
CCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 
ACTAAGAAGAAGTCAGCAAGCAAACTGAGAGAGGTGAAATCCATGTTAATGATGCTTAAGAA 
ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAACTCAGCCATCCTTAGAGAA 
CTGTGGTGCCTGTTTCTTTTCTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 
TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTCCAGTTGCACTGTATCTCTGGAAGTGA 
TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 
ATGTATGGATTACTTTTTTTTGNGCNCAGGGCC 
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FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALLAIGNVLFVAGLAFVIGLERTFRFFFQK 
HKMKATGFFLGGVFWL I GWPL IGMI FE I YGFFLLFRGFFPWVGF IRRVPVLGSLLNLPGI 
RSFVDKVGESNNMV 

Important features: 
Transmembrane domains: 

amino acids 12-30 (typell), 33-52, 69-89 and 93-109 

N-myristoylation sites. 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl- transfer RNA synthetases class -II protein. 

amino acids 4 9-59 
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FIGURE 91 



GAAGACGTGGCGGCTCTCGCCTGGGCTGTTTC CCGGCTTCATTTCTC CCGACTC AG CTTCCC 
ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCCACCACTGCAGCCATGATCTCCTTAA 
CGGAC ACGCAGAAAATTGGAATGGGATTAACCGGATTTGG AG TGTTTTTCCTGTTCTTTGGA 
ATGATTCTCTTTTTTGACAAAGCACTACTGGCTATTGGAAATGTTTTATTTGTAGCCGGCTT 
GGCTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 
CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATG 
ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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FUGURE 92 

GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAA 
GGCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGA 
CTGTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATGGACCTCGCGGGACTGC 
TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATC 
ATC^CACCATTCAGCTCTTCACTCTCCTCCTCTGGCCCATTAACAAGCAGCTCTTCCGGAA 
GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGT 
CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT 
GC CAT CGTGGTTCTCAACCACAAGTTTGAAATTG AC TTTCTGTGTGGCTGGAGCCTGTCCGA 
ACGCTTTGGGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAA 
TTATCGGCTGGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGAT 
CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCT 
GATT CACTGTGAGGGCACACGGTTCACGGAGAAGAAGCAT GAGATC AG CATG CAGGTGGC C C 
GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATC 
ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAA 
TAATGAAAATCCAACACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATG 
TTAGGAGGATCCCACTGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCAC 
AAGCTCTACCAGGAGAAGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGA 
GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGC 
TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACG 
CTGGCCAGCTTCATCCTCGTCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGT 
GACG GAAATTGACAAGGG CT CTGC CTACGG CAACT CTGACAGCAAGCAGAAACTGAATGACT 
3ACTCAGGGAGGTGTCACCATCCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCT 
CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGA 
CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTT 
TTTTCCCCATGTGCTTTAGTGGGCTTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGC 
TGTGTGGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAG 
GGCAGGGCTGGGGACCGAAGGGGACAAGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGT 
AACC CTTGGTTGC CAGAGATAAAGTGAAAAGTGCTTTAGG TG AG AT GACT AAATTATGCCT C 
CAAGAAAAAAAAATTAAAGTGCTTTTCTGGGTCAAAAAAAAAAAA 
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MDLAGLLKSQFLCHLVFCYVFIASGLI INTIQLFTLLLWPINKQLFRKINCRLSYCISSQLV 
MLLEWWSGTECTIFTDPRAYLKYGKENAIVVLNHKFEIDFLCGWSLSERFGLLGGSKVLAKK 
ELAYVPIIGWMV^FTEMVFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHE 
ISMQVARAKGLPRLKHHLLPRTKGFAITVRSLRNWSAVYDCTLNFRNNENPTLLGVLNGKK 
YHADLYVRRIPLEDIPEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVN 
WLFWASLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTEIDKGSAYGNSDS 
KQKLND 
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IFIfCURK 94 

CTGAGGCGGCGGTAGC&TgGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCG 
GCGCACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAA 
GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTA 
TACAATTGACATT CAGAAATATATTC CATG CTAT CAGC TT TTTAGCTT TT AT AATT CTT C AG 
GCGAAGTAAATGAGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT 
TGGTACAAATTCCGTCGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAA 
AAACTTGCAGGAGCATTTTT CAAAC CAAGAC CTTGTTT TT CTGCTATTAACACCAAGTATAA 
TAACAGAAAGCTGCTCTACTCATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTT 
TTTCACAGGGTACCTTTAGTGGTTGCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAAC 
TGTATCAGGTTCCTGTATGTCCACTGGTTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT 
T TTTTGAAGAAGATGGAT CCTTAAAGGAGGTACATAAGATAAATGAAATGTATG CTTCATTA 
CAAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAAGACAGTGAACAAGCAGTAGATAAACT 
AGTAAAGGATGTAAACAGATTAAAACGAGAAATTGAGAAAAGGAGAGGAGCACAGATTCAGG 
CAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGAACATTTTTCTTTGTCAGGCATTA 

TGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGTAGACAATCTGACCTTAA 
TGGTAGAACACACTGACATTCCTG AAGCTAGT CCAG CTAGTA CAC CACAAAT CATTAAGCAT 
AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTAGATACACAAGA 
CAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAATGAGCAGCC 
CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTCCTACA 
TTTTQATCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAACAT 
TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCAC 
CTGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAAC 
ATCAGATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCT 
TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACC AGG C AC AGTGG CT C ACAC CTGTAA 
TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCC 
TGGGCAACGTATTGAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT 
TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAG 
TGATACTTTTTTAGAAGTACATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCA 
ATAAATTTGCAAAACATCATCTAAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 95 

MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEVVYTIDIQ 
KYIPCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEH 
FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 
MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVN 
RLKREIEKRRGAQIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSS 
CNYNHHLDWDNLTLMVEHTDIPEAS PASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKA 
NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 
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FIGURE 96 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCA 
GCGCAGGGCAGCCC^GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCC 
CCGCGGCGCCTGCC^CCCTTCCCTCCTTCCCCGCGTCCCCGCCTCGCCGGCCAGTCAGCTTG 
CCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCTGCTTCCCTGGGCCG 
CGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCT 
GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCA 
ACTCCAACTCCTTCTCCCTCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCC 
GCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAAGGTGGGAACGCGTCCGCCCCGGCCCGCA 
CCA^GCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTCAGCGCCGCGCTG 
CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGG 
CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCC 
AGGGTT CTACCTGCTGCTCT CAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGAT 
TTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTGCAAGCTGTCTTTGCTTCACGTTACAA 
GAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAAT C CC TGAATGATATGT 
TTGTGAAGACATATGGCCATTTATACATGCAAAATT CTGAGCTATTTAAAGATC TC TTCGTA 
GAGTTGAAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTG 
GGCTCGCCTCCTGGAGCGGATGTTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGT 
AT CTGGAATGTGTGAGCAAGTATACGGAGCAG CTGAAGCC CTTCGGAGATGT CC CT CG CAAA 
TTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGCTTAGCGGT 
TGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCC 
TGTTGAAGATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAAC 
TACTGCTCAAACATCATGAGAGGCTGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAA 
CAATTTCATAGATGCTATGCTGATGGTGGCAGAG AGGC TAGAGGGT CCTTTCAACATTGAAT 
CGGTCATGGATC CCAT CGATGTGAAG AT TT CTG ATG CTATTATGAACATGCAGGATAATAGT 
GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACG 
AATTTCTCGTTCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGG 
AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAA 
CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAACGTTTGCAACGATGAGAGGAT 
GGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTACCTGT 
TTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACC 
AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGAT 
GAAGAATGCATACAATGGGAACGACGTGGACTTCTTTGATATCAGTGATGAAAGTAGTGGAG 
AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAGTTTGACTACAATGCCACT 
GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCACA 
GGCCTACCTCCTC^CTGTCTTCTGC^TCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAAT 
TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTA 
CC^TCCTAGTGACTTTGCTTTTTAAATGAATGGACAACAATGTACAGTTTTTACTATGTGGC 
CACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCA 

CATTGAGTTGGTTCCTGCTC CC CCAAACCATGTT AAACGT GG CTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTC 
ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTGTT^ 

GGTCCCTT CTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTGTACAGAAG CA 
GGTTTTATTTATCATGTT AT CTTATTAAAAGAAAAAGCCCAAAAAGC ^ ' 
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FIGURE 91 

MARPGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQ 
GSTCCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 
VKTYGHLYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEY 
LECVSKYTEQLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDVVSKVSVVNPTAQCTHAL 
LKMIYCSHCRGLVTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIES 
VMDP IDVKI SDAIMNMQDNS VQVSQKVFQGCGPPKPLPAGR I SRS I SESAFSARFRPHHPEE 
RPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLF 
AVTGNGLANQGNNP EVQVDTSKPD I L I LRQ I MALRVMTSKMKNAYNGNDVDFFD I S DE SS GE 
GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 
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FIGURE 9$ 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCT 
GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAA 
GCAACTTACAGCTGCACCGACAGTTGCGATSAAAGTTCTAATCTCTTCCCTCCTCCTGTTGC 
TGCCACTAATGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCAC 
AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAA 
AGATTGGTTCCTGAGAGCCCCGAGAAGAAAATTCATGACAGTGT CTGGGCTGC CAAAGAAGC 
AGTG CC C CTGTGAT CATTTCAAGGGCAATGTG AAGAAAAC AAGACACCAAAGGCAC CACAGA 
AAGC CAAACAAGCATTCCAGAGC C TG CCAG C AATTTCTCAAACAATGT CAGCTAAGAAGCTT 
TGCTCTGCCTTTGTAGGAGCTCTGAGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAA 
GACAGTGAGCACACCTACCAGACACTCTTCTTCTCCCACCTCACTCTCCCACTGTACCCACC 
CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAGATCATTTTGTTTGTTGCTCTC 
TCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCTGTGCCCTCCCCTTACCCAGGCTTAGGCTT 
AATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCTAGCTAGTGTCATTTAACCTTAAATGC 
AAT CAGGAAAGTAGCAAAC AGAAGT CAATAAAT ATTTTTAAATGT CAAAAAAAAAAAAAAAAAA 
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F I G U R E 99 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRR 
KFMTVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 
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FIGURE 100 

AATGGCTGTCTTAGTACTTCGCCTGACAGTTGTCCTGGGACTGCTTGTCTTATTCCTGACCT 
GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAG CCAGACGACT CGGGCAAAGACC CA 
AAGCCAGACTTCCCCAAATTCCTAAGCCTCCTGGGCACAGAGATCATTGAGAATGCAGTCGA 
GTTCATCCTCCGCTCCATGTCCAGGAGCACAGGATTTATGGAATTTGATGATAATGAAGGAA 
AACATTCATCAAAGTS^CATCCTCAGGACACACCCATGTGGCTCCTGGACAATC CAAGAG CA 
GCCAAATCCTGCTTTTCCAGTTTGGCTCCACAAGTCCTCCAGGACAGAGCCCTCAAAGCAAC 
TCCCAACGAGTTCTCAGGATTCAGGCTCTGGCTTCAACCAAACAGAACTCATTTTGAACACC 
CTGACTGCATTTTTGCTTTTAGAAAGTTAGAATAAATATGGCGCTTTGGGATCACATAGTTG 
ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Wl 



MAVLVLRLTVVLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENAVE 
FIIiRSMSRSTGFMEFDDNEGKHSSK 
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FIGURE 102 

GG ACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGC AC AG CT 
CAGAGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGGTGCTTCTTAC 
CCTGCCCCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCC 
CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAG 
CTCTTCAGCCAGATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGG 
CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACC 
CAAATC CC CACTTTGAGAAGTT CC TGACAAAG AGCATGGCTGAGAACAGGCACCTCCAATAT 
GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGT 
GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCC 
GGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGA 
AGCTGGGCCTTCATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTG 
CTGCCTCACCAGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGG 
AACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTC 
AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAACA 
AGCCACC CAC CAGC CTAT CTATCTTCCACTGAGAGGG A C CTAGC AGAATGAGAGAAGACATT 
CATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGCAATCTCTAACTTCAATC 
CCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGGAAACACTAGGACCC 
TGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTCCCAATGTTGTC 
CCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACACCCATGCGT 
CTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGACCCTCT 
CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGGAT 
GCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGA 
AACCACG 
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— ^\ off/St o 
FIGURE 103 

MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQI 
KGLTGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFW 
APGEDMRQLADGSMDVVVCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFM 
WQQVFEPTWKH I GDGCCLTRETWKDLENAQFSE I QMERQ PP P LKWLPVGPH I MGKAVKQS FP 
SSKALICSFPSLQLEQATHQPIYLPLRGT 
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FIGURE 104 

GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGGAGG 
CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTTAAtVA. 
ACTAATATTTAT ATGACAGAAGAAAAAGATGTCATT C CGTAAAGTAAACAT CATCATCTTGG 
TC CTGGCTGTTGCTCTCTTCTTACTGGTTTTGCA.CC AT AACTTCCTCAGCTTGAGCAGTTTG 
TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACTTTGTCCCAAA 
TGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATCTG 
AAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACAACACTCGCTCCAAT 
GTGATTTTCTACATTGTTACTCTCAACAATACAGCAGACCATCTCCGGTCCTGGCTCAACAG 
TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAA 
AAGTAAAGGAGGATCCTGACCAGGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTAC 
TTGCCAATTCTGGT TC CC AG CG CAAAGAAGGCCATATACATGGATG ATGATGTAATTGTG CA 
AGGTGATATTCTTGCCCTTTACAATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAG 
AAGATTGTGATTCAGCCTCTACTAAAGTTGTCAT CCGTGGAG CAGGAAACCAGTACAATTAC 
ATTGGCTATCTTGACTATAAAAAGGAAAGAATTCGTAAGCTTTCCATGAAAGCCAGCACTTG 
CTCATTTAATCCTGGAGTTTTTGTTGCAAACCTGACGGAATGGAAACGACAGAATATAACTA 
ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGAGGGACTGTATAGCAGAACCCTGGCT 
GGTAGCATCACAACACCTCCTCTGCTTATCGTATTTTATCAACAGCACTCTACCATCGATCC 
TATGTGGAATGTCCGCCACCTTGGTTCCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAA 
AGGCTGCCAAGTTACTCCATTGGAATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATAT 
ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAG 
ATATACCGAGAT CT CAAACATAAAG TGAA ACAGAATTTGAACTGTAAGCAAG CATTTCTCAG 
GAAGTC CTGG AAGATAGCATGCATGGGAAGTAACAGTTG CTAGGCTTCAATG C CTATCGGTA 
GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTC 
AGCTTCCCAGACAGACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTT 
ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGA 
TAATTCAAAACTGCTGTTGGTTTTAATTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTT 
ACATTTTTC 
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FIGURE WB 



MSFRKVNIIILVLAVALFLLVLKHNFLSLSSLLRNEVTDSGIVGPQPIDFVPNALRHAVDGR 
QEEI PWIAASEpRLGGAI AAINS IQHNTRSNVI FYI VTLNNTADHLRSWLNSDSLKS IRYK 
IVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAI YMDDDVIVQGDILALYNT 
ALKPGHAAAFSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 
NLTEWKRQNITNQLEKWMKLISFVEEGIiYSRTLAGSITTPPLLIVFYQQHSTIDPMWNVRHLGS 
SAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDVWEKWYIPDPTGKFNLIRRYTEISNIK 
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FIGURE 1(06 

TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTACAGATT 

CAGGAATTNTAGGNCCTCAACCTNTAGANTTTGTCCCAAATGTTCTCCG 

GGGAGACAAGAGGAGATT CC TGTGGT CATCGCTG CATNTG AAGACAGG CTTGGGGGGGC C AT 

TGCAGCTATAAACAGCATT CAG CACAAC AC TCGNTC CAATGTGATT TT CT ACATTGTTACTC 

TCAACAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAAAGCATCAGA 

TACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCA 

GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCG 

CAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTAC 

AATACAGCACTGAAGC CAGGACATGCAGCTG CATTTTC AGAAGATTGTGATT CAG C CT CTAC 

TAAAGTTGTCATCCGTGGAGCAGGAAA 
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FIGURE 107 

CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 
GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGT3CAGGGAACGCGGGCGGCCAGACAACGGGC 
TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 
TCCGCATCTCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAACGACT 
TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 
TTCCGATCACTTCAAAGTGGTTAAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATT 
CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTTGCAC 
CATAACTT CCTCAGCTTGAGGCAGTTTGTTAAGGAATGAGGTTACAGATT CAGGAATTGTAG 
GGCCTCAACCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 
GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 
ACAGCATTCAGCACAACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 
GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTG 
TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 
ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 
CCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCA 
CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGT 
CATCCGTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 
TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAAC 
CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 
AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 
TATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 
GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGAATGGACA 
TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 
GACCCAACAGG CAAATTC AAC CTAATCCGAAG AT ATAC CGAGAT CT CAAACATAAAGTGAAA 
CAGAATTTGAACTGTAAGCAAGCATTTCT CAGGAAGT C CTGG AAGATAG CATG CGTGGGAAG 
TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 
GTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 
ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 
CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTTAATTTT 
GTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTCAATAGGTAAAAAAAAAAAAAA 
AAAAAA 
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FIGIME 108 



PCT/US99/12252 



CTGCAGGTAGACATCTCCACTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 
GAAGGCCGGCCATACCAGAGTCGTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 
TCTGTGCTGGTCTGAGGGTGCTGCCTGTC&IgGGGGCAGCCATCTCCCAGGGGGCCCTCATC 
GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 
GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 
TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 
TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCCAGGACC2MGTCCACCTCACCTAGAG 
CCTGGAATTAGGATCCCAGAGTTCAGCCAGCCTGGGGTCCAGAACTCAAGAGTCCGCCTGCT 
TGGAGCTGGACCCAGCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCTCAGTGGCCC 
TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 
ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 
TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 
GCTGGGCCCCTCCCCTGGTCCTCCCAGTGTTTGCTGGATAATAAATGGAACTATGGCTCTAA 

AAAAAAAAAAAAAAAAA 
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MGAAISQGALIAIVCNGLVGFLLLLLWVILCWACHSRLPTLTLSLNPVPTPALAPVLRRPHH 
PRSPAMKAATCCSPEGPWPSLEPRT 
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FIGURE 110 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCA 
GTTCCTCCAAGCAAGTCATTTCeCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTA 
CTCCCTATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAA 
TCATQTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCC 
ATGATGTTTACCTTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTAT 
TTTGGGATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACC 
TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTA 
TCCACAGGCATCACGGCAGTGCTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATT 
GACAGTTGAGCTTTTCCAAATCACAAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCC 
AGCCACTGTGGACATTTGCCATCCTCATTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTG 
AGCCTGGGAACTGCAGGAGCTGCCCAGGTTATGGAAGGCGGCCAAGTGGAATATAAGCCCCT 
TTCGGGCATTCGGTACATGTGGTCGTACCATTTAATTGGCCTCATCTGGACTAGTGAATTCA 
TCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAGTGGTTACTTGTTATTTCAACAGAAGT 
AAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTCTCCATTCTCTTCTTCTACCATCA 
AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAGGATTCCGAGAATCATTGTCA 
TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGTTCCGA 
TGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTCAACCAGAATGCATA 
TACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGCATTCAAAATCT 
TGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 
GGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTACAATCG 
GGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGCCC 
ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGAT 
CTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTT 
CGTAAAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGA 
ATGAGGAGGGAACAGAACTCCAGGCCATTGTGAG ATAQA TAC CC ATTT AGGTAT CTGT ACC T 
GGAAAAC^TTTCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTT 
AGTGAATTTTTTTTTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 111 

M SGRDT I LGLC I LALALSLAMMFTFRF I TTLLVH IFISLV I LGLLF VCGVLWWLYYD YTNDL 
S I ELDTERENMKCVLGFA I VSTG I TAVLLVL I FVLRKR I KLTVE LFQ I TNKA I S S APFLLFQ 
PLWTFAI L I FFWVLWVAVLLSLGT AGAAQVMEGGQVE YKPLSG I RYMWSYHL I GL I WTSEF I 
LACQQMTIAGAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVM 
YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 
S KNS SHFTS I NCFGDF 1 1 FLGKVL WCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 
SFLSVFETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRN 

EEGTELQAIVR 
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FIGURE 112 



gttcgattagctcctctgagaagaagagaaaaggttcttggacctctccctgtttcttcctt 
agaataatttgtatgggatttg tgatgcaggaaagc ct aagggaaaaagaatattcatt ctg 
tgtggtgaaaattttttgaa;.aaaaaattgccttcttcaaacaagggtgtcattctgatatt 

T ATGA GGACTGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGG 
TGACTGGAGTACATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTG 
CCTCAGATCAACTGCGATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATG 
TCCAGCAGGATGCCAAGACCCCAAATACCATGTTTATGGCACTGACGTGTATG CATC CT ACT 
CCAGTGTGTGTGGCGCTGCCGTACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTT 
GTTCGGAAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTT 
ATCCCTACCACGATGGAGAGAATCCTTTATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAA 
CCTACCCATCAGCT CTTACATACT CAT CATCGAAAAGTCCAGCTGCC CAAGCAGGTGAGAC C 
ACAAAAGCCTATCAGAGGCCACCTATTCCAGGGACAACTGCACAGCCGGTCACTCTGATGCA 
GCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCACCACCTTGCCAAGGCCATCCCCTTCTG 
CTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGGG CC AC AGGAGC CAGGAGATGGAT 
CTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCCAGAGCTGATCCAGGTATCCA 
AAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGCGGATGTCAGCCTGGGAC 
TTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCCTGGGAGATCCAAAC 
TGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAACGGCGATTCCG 
AATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGCCGGTCCAC 
TGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACACACACG 
AATTCT CGAGAT CTGAAGACAGCCATAGAGAAAATT AC TC AG AGAGGAGGACTTTCTAATGT 
AGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCA?VAGCCAATGGAAACAGAAGCG 
GGGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCT 
T CAAGACTTGCGAGAGAGTCAGGAAT CAACAT TTTCTT CATCACCATTGAAGGTGCTGCTGA 
AAATGAGAAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACG 
GCTTCTACTCGCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTG 
AAGCGGGTCTGCGACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACAT 
/ TGGCTTCGTCATCGACGGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGT 

TTGTGACCAACCTCACCAAAGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTG 
CAGTACACCTACGAACAGCGGCTGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACAT 
CCTCAACGCCATCAAGAGGGTGGGCTACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCA 
ACTTCGCCCTGGAGCAGCTCTTCAAGAAGTCCAAGCCCAACAAGAGGAAGTTAATGATCCTC 
ATCACCGACGGGAGGTCCTACGACGACGTCCGGATCCCAGCCATGGCTGCCCATCTGAAGGG 
AGTGATCACCTATGCGATAGGCGTTGCCTGGGCTGCCCAAGAGGAGCTAGAAGTCATTGCCA 
CTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGAGTTTGACAACCTCCATCAGTATGTC 
C CCAGGATCATCCAGAACATTTGTACAGAGTTCAACTCACAGCCT CGGAACTGAATTCAGAG 
CAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTTGGACCACCCCACCGCTTAA 
TGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAACAAATGTCTTGTTATTA 
TTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGATGATCACAAACGT 
ATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATTTTACATTTTGACAATT 
GTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAGAGCTTTT 
GTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCATTTTT 
GTCATGACAATGTAGGAATTGCTGAATTAAATGTTTAGAAGGATGAAAAATAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 113 

MRTVVTjTMKAS VI EMFLVLL VTGVHSNKETAKK I KR P KFT VPQ I NCDVKAGK 1 1 D P E F I VKC 
PAGCQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 
SLPRWRESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQ 
LliAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQ 
RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 
IQKQLLADVAQALDIGPAGPLMGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNV 
GRA I S F VTKNFF SKANGNRS GAPNVVVVMVDGWPTDKVEEASRLARE S G I N I F F I T I EG AAE 
NEKQYWEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADI 
GFVIDGSSSVGTGNFRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDI 
LNAI KRVGYWSGGTSTGAAINFALEQLFKKSKPNKRKLMI LITDGRSYDDVR I P AMAAHLKG 
VITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNICTEFNSQPRN 
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FIGURE 114 

CAGGATGAACTGGTTGCAGTGGCTGCTGCTGCTGCGGGGGCGCTGAGAGGACACGAGCTCTA 
TQCCTTTCCGGCTGCTCATCCCGCTCGGCCTCCTGTGCGCGCTGCTGCCTCAGCACCATGGT 
GCGCCAGGTCCCGACGGCTCCGCGCCAGATCCCGCCCACTACAGTTTTTCTCTGACTCTAAT 
TGATGCACTGGACACCTTGCTGATTTTGGGGAATGTCTCAGAATTCCAAAGAGTGGTTGAAG 
TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCTGTGTTTGAAACAAACATT 
CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCT C TC CAAGAAGGCTGGGGTGGAAGTAGA 
GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAGGCGGCCCGAAAACTCC 
TCCCAGCCTTTCAGACCCCCACTGGCATGCCATATGGAACAGTGAACTTACTTCATGGCGTG 
AACCCAGGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTTCATTGTTGAATTTGC 
CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 
GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGCAACCACATTGATGTGCTCACTGGC 
AAGTGGGTGGCCCAGGACGCAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 
GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 
CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 
GTGTCCATGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 
AGACATTGACAATGCCATGAGGACCTTCCTCAACTACTACACTGTATGGAAGCAGTTTGGGG 
GGCTCCCGGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 
CTTCGGCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCCCACCCT 
CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCAGCAAGGTGGAGTGCGGAT 
TTGGAACAATCAAAGATCTGCGAGACC7VCAAGCTGGACAACCGCATGGAGTCGTTCTTCCTG 
GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACCAACTTCATCCACAACAATGG 
GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTGGGGGGTACA 
TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCAGAGGCTGAAG 
GAAGAGCAGTGGGAGGTGGAGGACTTGATGAGGGAATTCTACTCTCTCAAACGGAGCAGGTC 
GAAATTTCAGAAAAACACTGTTAGTTCGGGG CCATGGGAACCTCCAGCAAGG CCAGGAACAC 
TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTCCCA 
CTTCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGGGACAGGTTTTCCT 
AGACTCCTC ATAA CCACTGGATAATTTTTTTATTTTTATTTTTTTGAGGCTAAACTATAATA 
AATTGCTTTTGGCTATCATAAAA 



I 
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FIGURE HIS 

MP FRLL I PLGLLCALLPQHHGAPGPDGS AP DP AH YS FSLTL I DALDTLL I LGNVS E FQRWE 
VLQDSVDFDIDVNASVFETNIRVVGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKL 
LPAFQTPTGMP YGTVNLLHG VNPGET P VTCTAG I GT F I VE FATLS S LTGD P VFED VAR VALM 
RLWESRSDIGLVGNHIDVLTGKWVAQDAGIGAGVDSYFEYLVKGAILLQDKKLMAMFLEYNK 
AI RNYTRFDDWYLWVQMYKGTVSM PVFQSLEAYWPGLQSL I GD I DNAMRTFLNYYTVWKQFG 
GLPEFTNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECG 
FATIKDLRDHKLDNRMESFFLAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGY 
IFNTEAHPIDLAALHCCQRLKEEQWEVEDLMREFYSLKRSRSKFQKNTVSSGPWEPPARPGT 
LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 
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FIGURE I U 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 

GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 

ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 

GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 

AA£GCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 

ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 

TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 

AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 

GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 

ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 

GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 

TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 

CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 

GGGTATT C CAGTGCACCTAGAAACCATGGAG CCAGGGG CTG CATACT GTGTGAAGGC CCAGA 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 

GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 

GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 

TGGTCCTC CC AGAC AC CTTGAAAAT AACCAATT C ACC C CAG AAGTT AAT CAGCTG CAG AAGG 

GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 

CTC ATAG GTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 

ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 

GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 

TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 

CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 

TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 

TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 

TGTTTCTGGAGAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 

GGCTTGGAGAGCCC ACTTTCCCAGAATAATCCTTGAGAGAAAAGGAAT CATGGGAG CAAT GG 

TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 

AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 

CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 

GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 

TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 

AAAAAAAAAA 
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FIGURE 117 

MQTFWVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
S I LKHPFNRNST I LTRPGME ITKDGFHLVI ELEDLGPQFEFLVAYWRREPGAEEHVKMVRSG 
GIPVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAIPLVLALFAFVGFMLILV 
WPLFVWKMGRLLQYSCCPVWLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features : 
Signal peptide: 

amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites. 

amino acids 40-43 and 134-137 

Tissue factor proteins homology. 

amino acids 92-119 

Integrins alpha chain protein homology. 

amino acids 232-262 
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FIGURE 118 
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TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCGGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGG CTGAGATGACAGAATGCTTTATTTTGG AAAGAAACAATG TT CTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCJ^TATGACCCA^ 
ATACTCAATATGGACGAANTGCTATTGT CCAC CTGTTTGAGTGGCGCTGGGTTGAT 
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FUGURE 1119 

CGGACGCGTGGGCCGCCACCTCCGGAACAAGCCATgGTGGCGGCGACGGTGGCAGCGGCGTG 
GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 
TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 
AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 
AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTTGGCCAACAGG 
AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 
ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAAGTACCTGGCCCA 
GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 
TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 
GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTATAACCACCGCGTCTCCTCCTCCACCA 
CCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAG 
AGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATTGGTCCCATCATTCTTGTGGGGGAA 
AAATTCTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 
AGCTCTTGACCAGTGAATCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 
TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 
CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 
CAAATAGGAGGCATTCAATGAAC^TTTTTTGCATATAAACCAAAAAATAACTTGTTATCAAT 
AAAAACTTGCATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 
GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 
AACAAT AC CT CACGAT ATAAAAT AAAAATG AAAGT AT C CT C CT C AAAAA 
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FIGURE 120 

MVAATVAAAWLLLWAAACAQQEQDF YDFKAVN I RGKLVSLEK YRGS VS L WN VASE CG FTDQ 
HYRALQQLQRDLGPHHFNVLAF PCNQFGQQEPDS NKE I ES FARRTYS VS F PM FSKI AVTGTG 
AHP AFKYLAQTSGKE PTWNFWKYLVAPDGKWGAWDPT VS VEEVRPQ I TAL VRKL I LLKREDL 
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PCT/US99/n252 



F1FCURE 121 

CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCATSGCTGTCTACGTCGGGATGC 
TGCGCCTGGGGAGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGCCCGGGCCGCCCTCTCT 
CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 
CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCAAAAAGCATCTTA 
ACAGCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAACGAGAGGCC 
TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 
AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 
CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 
TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 
GGCCCTTGTGTTCCCCAAGCAATTCAAGACCCAGCAATACTACAACGTCCTGAAGCAGATCT 
GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCCTTGAAGAGTCAGAGGCTCCCAGATCTGACC 
ACAGTCATCTCGGTGGATGCCCCTTTGC CGGGGACCCTGCTCCTGGATGAAGTGGTGG CGGC 
TGGCAGCACACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 
ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTCC 
CACTACAACATTGTCAACAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 
ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 
GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 
AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGAT 
GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 
GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 
ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 
CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGG 
CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 
ATCCGAGGGTACTGCGTCATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGT 
GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 
AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 
CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 
CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 
TGGAGGAGATAAAAGCTTTCTGCAAAGGGAAGATCTCTCACTTCAAGATTCCGAAGTACATC 
GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 
GCAGATGGAACGACATCTAAATCTGTGAATAAAGCAGCAGGCCTGTCCTGGCCGGTTGGCTT 
GACTCTCTCCTGTCAGAATGCAACCTGGCTTTATGCACCTAGATGTCCCCAGCACCCAGTTC 
TGAGCCAGGCACATCAAATGTCAAGGAATTGACTGAACGAACTAAGAGCTCCTGGATGGGTC 
CGGGAACTCGCCTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 
TCCATCCCCCACATTCCCCTGTCTGTCCTTGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 
GAAAAAAAAAAAAAAAA 
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FIGURE 122 



MAVYVGMLRLGRLCAGSSGVLGARAALSRSWQEARLQGVRFLSSREVDRMVSTPIGGLSYVQ 
GC TKKHLNSKTVGQCLETTAQRVPEREALVVLHEDVRLTFAQLKEEVDKAASGLLS IGLCKG 
DRLGMWGPNSYAWVLMQI^TAQAGIILVSVNPAYQAMELEYVLKKVGCKALVFPKQFKTQQy 
YNVLKQICPEVENAQPGALKSQRLPDLTTVISVDAPLPGTLLLDEWAAGSTRQHLDQLQYN 
QQFLSCHDPINIQFTSGTTGSPKGATLSHYNIVNNSNILGERLKLHEKTPEQLRMILPNPLY 
HCLGSVAGTMMCLMYGATLI LASP I FNGKKALEAI SRERGTFLYGTPTMFVDILNQPDFS S Y 
DISTMCGGVIAGSPAPPELIRAIINKINMKDLWAYGTTENSPVTFAHFPEDTVEQKAESVG 
R IMPHTEAR I MNMEAGTLAKLNTPGELC I RGYCVMLGYWGE PQKTEE AVDQDKWYWTGDVAT 
MNEQGFCKI VGRSKDMI IRGGENI YPAELEDFFHTHPKVQEVQWGVKDDRMGEEI CACIRL 
KDGEETTVEE IKAFCKGKISHFKI PKYIVFVTNYPLTI SGKIQKFKLREQMERHLNL 
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FIGURE 123 

CAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 
TGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAGGCACAATGATGTGTCTG 
ATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGCAAGAAGGCACTGGAGGC 
CATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCGTGGACATTCTGA 
ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 
CCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAATATGAAGGACCTGGTGGT 
TGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 
AGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 
GAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGCATCCGAGGGTACTGCGT 
CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 
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FIGURE 124 

GAGCAGGACGGAGCCATOGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGC 
AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCG 
TG CAGAAAGCAGATGACGGATGCT C CC CGAAC AAGATG AAG A CAGTGAAGTG CGCG C CGGG C 
GTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGC 
AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGC 
TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTC 
ACCT CG CGGG CG CT CGACCCGGCAGGTAATGAGAGTGCATACC CGCCCAACGGCGTGGAGTG 
CTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCT 
GCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCA 
GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGA 
TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACT 
CTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCT 
CCAGAGCCCACGACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAG 
AC CCACATCCACCACCAAACC CATGC CAG CGC CAAC CAGT CAGACT CCGAGACAGGGAGT AG 
AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGAC 
CGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGG C C C C AG CAGC CC CATAATAAAGGCTG 
TGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTG1 
^GCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCT 
CATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCC 
CAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATAT 
ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTC 
TCCGCTTGTC CT CTTGTGATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTG 
AGAGAGAGGATGCTAAGCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGG 
GGTGGGTGGGACAATGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGG 
GAATCGGTTCCCCATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTAC 
CCAATTCGCCCTATAGTGAGTCGTA 
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FIGURE 1125 

MDPARKAGAQAMIVfTAGWLLLLLLRGGAQALECYSCVQKADDGCSPNKMKTVKCAPGVDVCT 
EAVGAVETIHGQFSLAVRGCGSGLPGKNDRGLDLHGLLAFIQLQQCAQDRCNAKLNLTSRAL 
DPAGNESAYPPNGVECYSCVGLSREACQGTSPPWSCYNASDHVYKGCFDGNVTLTAANVTV 
SLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTT 
VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 
QYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 126 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGOGGACCGGCTGTGCAGACGCC&ISGAGT 
TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCivGCTGAAAAG 
GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTT 
CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTT 
TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCC 
AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTG CAGCCATCAGGTGGAAGC CTCTGGAA 
C CTGAGGCGGCTGCTTGAACCTTTGG ATGC AAATGT CGATGCTJft&GAAAAC CGGC CACTTC 
AGCAACAGCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTA 
ACACCATTCCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCT 
GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTG 
GTCTTTGTGGCTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTT 
CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCAC 
CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGC 
GTAGGGTAAGAGCACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCAC 
TTTGTCATCATTCTTCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCC 
ACCTGATCCCAGTCTGAAGGTCTCTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCC 
CAGCGTTGACGTCAGGCAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACG 
AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCT 
ACTCTGCCCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGG 
GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAG 
ATGCAGACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 
GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 
TCCCCTTAGATGGGCAGCAGAGGCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGA 
GCGGTGAGCGAGGTGGGTTGGAGACT CAGC AGGCTC CGTGCAG C C CTTGGGAACAGTGAGAG 
GTTGAAGGTCATAACGAGAGTGGGAACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTT 
CCCGCGGAAACCAACCAAACCGTGCGCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTAT 
CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 
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FIGURE 127 

MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVFAVVLFSVGILLILSRRCK 
CSFNQKPRAPGDEEAQVENLITANA1 ; EPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 128 

AAACTTGACGCCATOAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTGCT 
CCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAATTATG 
CGTCACGACCCGAGGCCTTTAACACCCCGTTCCTGAACATCGACAAATTGCGATCTGCGTTT 
AAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAACTTCCTTT 
CCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCTGATGCCCAGI 
SftCCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 
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FIGURE 129 

MKIPVLPAWLLSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFLNIDKLRSAFKADE 
FLNWHALFESIKRKLPFLNWDAFPKLKGLRSATPDAQ 
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FIGURE 13(0) 

CAGTTCTGAAAT CAATGGAGTTAATTTAGGGAATAC AAAC CAGCCATGGGGGTGGAGATTG C 
CTTTGCCTCAGTGATTCTCACCTGCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTTGTTC 
TTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCCTGTGGC 
TTTGCCGGCCACTCAT^GAGTGTITTTGTGTAAAGTATTTTTTAGAATACTGTTGACTTCT 
TCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATGTCAACCCTCA 
AATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGTCCCTTTATTTTT 
AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGCAGCTTCATTGATTTTGTTTATC 
TTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCACGCAGACCTCTAGGAGAATATTT 
TATCCCTGGGTG CC CCTG ACACATTT ATGTAGTGAT C C CA CAAATGTGATTGTTAATTTAAA 
TGTTATTCTAATATTAGTACATTCAGTTGTGATGTAATATGAATAACCAGAATCTATTTCTT 
AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATAGTGATG 
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FIGURE 131 

MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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FIGURE 132 

GGGGAATCTGCAGTAGGTCTGCCGGCGASGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 
GCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 
AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 
TGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGCATCTCCAG 
GAAGATGATGGCAGAGGT AGTC AG ACGGAAGCTAGGGACC CACT AT CAGATC ACTAAGAACA 
GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 
TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 
GGTT CCTAAATGGATGGAG C CTGC CAT C C CAGTCTT CT CCTTCAGTAAGACATCAGAGTACC 
ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 
CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 
GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGATCAAGGACAAGTCCAGAACGAG 
ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 
CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGGATGTCCATCTTGT 
GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 
AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 
TATCCACAG CTG AAGC CATGGGTT CACTATAT CC CAGT CAAAACAGAT CT CTC C AATGTCCA 
AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 
GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 
AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 
TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 
AC AGATCT CAGATATCCTACGGTGAGAAG CTT AC CATAAGCTTGGCTCCTATAC CTTG AATA 
TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 
AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 
CC AGAAAT CGTGAGATGTGGATTTTGAACC CAACTCTAC CTTTCATTT TC TT AAGACCAAT C 
ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 
TGTGATGATGCCCITTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 
CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 
AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 
TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 
GAGG TTTC TATAATG C CACATAGAAAGAGG CCAATTGCATGAGTAATTATTGCAATTGGATT 
TC^GGTTCCCTITTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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133 



MEwWASSPjLiKLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGVIE 
EDLTPFRGGISRKMMAEWRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 
MEMV 1 NVRDYPQVPKWME PA I PVFSFSKTSEYHD IMYP AWTFWEGGPAVWP I YPTGLGRWDL 
FREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 
LGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 
YIPVKTDLSWQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
NVTRRKGYDQ I I P KMLKTEL 
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FIGURE 134 

CACCCCTCCATTTCTCGCCATGGCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 
TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 
GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 
CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 
ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 
AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCTTGCAGCTGGTGATGCGGTACTGGGAGCC 
CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 
TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 
GACTATGCTGAGCTCATGGGCCTCAAACAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 
TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 
TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 
TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 
CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 
GAGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCTAAATCCTTAAC 
ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 
CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 
CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 
TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 
CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 
GGCTTCCACTCOVAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 
GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 
GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 
CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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FIGURE OS 

MAPALLblPAALASFIIiAFGTGVEFVRFTSLRPLLGGIPESGGPDARQGWLAALQDRSILAP 
LAWDLGLLLLFVGQHSLMAAERVKAWTS RY FG VLQRS L YVACT ALALQ L VMR YWE P I P KG P V 
LWEARAEPWATWVPLLCFVLHVISWLLIFSILLVFDYAELMGLKQVYYHVLGLGEPLALKSP 
RALRLFSHl^PVCVELLTVLWWPTLGTDRLLLAFLLTLYLGLAHGLDQQDLRYLRAQLQR 
KLHLLSRPQDGEAE 
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FIGURE 136 
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CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 
AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 
TT T AATT AAGC ATGGAAT AC AG AAAACAA CAAAAAA CT T AAG CT TT AATT T CAT CT G G AATT 
CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 
TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 
AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 
CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 
TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 
ACAGACAAGACTTTCACTTCAC ACTT CGAGAG CATT CAAACT GCT CTCAT CAAAATCCATTT 
CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 
TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 
AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 
GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 
ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 
TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 
TTCACAGGTTATC CTCTAATTGATAATTATT CCTATAGAGGATTTTAC CAAAAAAC CC AT AT 
TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 
TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 
GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATT CATATT CC AGAAGA 
CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 
CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 
ACATGCC^TTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 
GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 
AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 
AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 
GAAATT AATAGGACCAAACAATTTGGACATGT CATT CTGT AGACTAGAATTTCTTAAAAGGG 
TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 
AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 
TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 
TGGT CATTTATAAAGTACTTCAAGATGTTGCAGT ATTT CACAGTTATTATTATTTAAAATTA 
CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 
CATT CTTTACATG CAAACATTTTCCAGTTACTTAACTGATCAGTTT ATTATTGATA CATCAC 
TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 
ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 137 

MASALWTVLFSRMSLkSLKw^ 

FHFTLR^HSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEK 
EDKMLALSLEDEHLLYGD 1 1 RQDFLDTYNNLTLKT I MAFRWVTE FC PNAK YVMKTDTDVF I N 
TGNLVKYLLNLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 
LVPRI YEMMGHVKP I KFEDVYVGI CLNLLKVN IH I PEDTNLFFLYRIHLDVCQLRRVI AAHG 
FSSKE I I TFWQVMLRNTTCHY 
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FIGURE 138 

CCTCTGTCCACTGCTTTCGTGAAGACAAG&SQAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GGAGTCTTTCTAGCTCCTGCCCTAG CTAACTATAATAT CAACGTCAATGATG ACAACAAC AA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACAT TG CAAACATGTGTCGTGGGATT CCAACAT ACATGG CTGAGGAGATGCAAGAGG C 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACSftAACAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTT CTCTACTAGTTATGTTTGATTTCTTTAAGTTT CAATAAAAT CATTTAGCATTGAAAAAAA 
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FUGUIRE 139 

MK.KT 1 V t'AGijijUVFiiAPALiAJN Y W1NVNDDNNN AGSGQQSVS VNNEHNVANV^ 

I WD YGNGFAATRLFQ KKTC I VHKMNKF VMPS I QS LDALVKEKKLQGKG PGGP P PKGLMYSVN 

PNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 



WO 99/63088 . PCT/US99/122S2 

FIGURE 1410 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 
TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 
CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 
TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 
GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 
CAAGCTTATAGTTGAAATATTTTTCAGGAATTACATgAATGACAGTCTTCGAACCAATGTGT 
TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 
CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 
CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTAC 
TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAG CCAAATTAAAAG CAAAGGG A 
TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 
AT CATC AC CAAG AGAAGTAAAAGCTGAAGAGAAATCAC CAAT CT C CAT TAATGTGAAGACAG 
TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 
GACAGCAAGAGAAGTAGAAATAGCAGAAGTGCAAGTCGATCGAGGTCAAGAACACGATCACG 
TTCTAGATCACATACTCC AAGAAGACACTATAATAATAGG CGGAGTCG AT CTGGAACATACA 
GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCTCGAAGACATCATAATCAT 
GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTCAAACAGACATGG 
TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 
CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 
GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 
C TGA CTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 
ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 
GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTTAATTTTTTTGCACATT 
AAAATGCCCTAGCAGTATCTAATTAAAAACCATGGTCAGGTTCAATTGTAOTTTATTATAGT 
TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 
ATACAGATAAAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 
ACTTTCTTAACACTGGTTTGT CTG CATGTGTAAAGATTTT TACAAGG AAATAAAATACAAAT 
CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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FIGURE 141 

MNDSLRTNVFVRFQPETIACACIYLAARALQIPLPTRPHWFLLFGTTEEEIQEICIETLRLY 
TRKKPNYELLEKEVEKRKVALQEAKLKAKGLNPDGTPALSTLGGFSE'ASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
S KSRDHSDAAKKHRHERGHHRDRRERS RS FERSHKS KHHGGSRS GHGRHRR 
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FIGURE 142 

TGGGGATAAAGGAAAAATGGTCAGGTAT TAATGG CTTAAAGATTAT TGGAAGGGGT TT AT CA 
TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 
TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 
CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 
AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 
ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 
AGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 
GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 
GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAATCC 
AGGAAATNTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTACTG 
GAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCNAATTAAAAGCAAAGGGATT 
GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 
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FIGURE 143 

GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTCilfiGC 
GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 
CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 
TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 
GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCCCCGTTCCAGCTGG 
AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 
TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 
GCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 
CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 
TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 
GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 
CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 
GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 
CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 
ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 
TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 
CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 
ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 
CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 
GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 
TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 
AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 
GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 
GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 
GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 
CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 
GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 
GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 
TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 
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FIGURE 144 

MAVLGVQLVVTLLTATLMHRLA 

RWANGL^EEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSC^YYLFTEAYY 
YMLGPAKETNI AVFWCLLTVTFS I KMFLTVTRLYFSAEEGGERSVCLTFAFLFLLLAMLVQV 
VREETLELGLEPGLASMTQNLEPLLKKQGWDWALPVAKLAIRVGLAWGSVLGAFLTFPGLR 
LAQTHRDALTMSEDRPMLQFLLHTSFLS PLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 
FD SGRLWLLWLCLLRLAVTRPHLQAYL CLAKAR VEQLRREAGR I EARE I QQRWRVYCYVT 
WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 
AGALGGLLTPLFLRGVLAYL I WWTAACQLLASLFGLY FHQHLAGS 
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FICIJRE US 

CG7TNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 
CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 
TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 
CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCC 
GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 
TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 
CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 
GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 
GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGCG 
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FIGUIRE 146 

GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTA 

TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGC 

ATGG CAAGGTTTGCTTAAAGGAGCTTGG CTGGTTTGGG CC CTTGTAGCTGACAGAAGGTGGC 

CAGGGAGAATGCAGCACACTGCTCGGAGAATSAAGGCGCTTCTGTTGCTGGTCTTGCCTTGG 

CTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTG 

TAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTC 

CAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACC 

ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGA 

CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCT 

TTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTT 

CGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGA 

AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTA 

CCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGC CTCTCTATTAGG CTGGTGGGAGGT 

AGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAG 

AGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATG 

TCCCTCACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTG 

ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCC C CGG ATGC CTACAGAC CC C G 

AGATGACAGCTTTCATGTGATTCTCAACAAAAGTAG CC C CG AGG AG CAGCTTGGAATAAAAC 

TGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCA 

TATCGACATGGTCAGCTTGAGGAGAATGAC CGTGTGT T AG CC ATCAATGGACATGATCTTCG 

ATATGGCAGCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCG 

TCGTGTCCCGCCAGGTTCGGCAGCGGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGC 

AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACTCCCAAGCCCCTCCATCCTAC 

AATTACTT GT CATGAGAAGGTGGTAAATATCCAAAAAG AC C C CGG TGAAT CTCTCGGCATGA 

CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAG 

CCCGGAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGA 

TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCAT 

CCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGC 

CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCTGGGT 

CATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACA 

CAG CTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAAC AAAC CT 

TTTTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGG 

TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAA 

GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTT 

TT ATAQA ATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAA 

ACACTATATTTATCTTGTCAGTTTTTAT AT TTAAAG AAAGAATACATTGT AAAAATGT CAGG 

AAAAGTATGATCATCTAATGAAAGC CAGTT AC ACOT CAGAAAATATGATTCCAAAAAAAT TA 

AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTT 

TTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACT 

CAAG CTGATTT AAATTTAAAATTTGGTATATG CTGAAG T C TG C C AAGGGT ACATTATGGC CA 

TTTTTAATTTACAGCTAAAATATTTTTTAAAATC 

ACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE 147 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTAT 
APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS 
FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 
SESLS I RLVGGS ET PL VH 1 1 IQHI YRDGVI ARDGRLLPGDI I LKVNGMD I SNVPHNYAVRLL 
RQPCQVLWLTVMREQKFRSRNNGQ APDAYRPRDDS FHV I LNKSS PEEQLG I KLVRKVDEPGV 
FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRS 
PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVAGGASHRE 
WDLP I YVI SVEPGGVI SRDGRI KTGD I LLNVDGVELTEVSRSEAVALLKRTSSS IVLKALEV 
KEYEPQEDCSSPAALDSNHNMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 
GGYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSGMIHACLARLLKELKGRI 
TLTIVSWPGTFL 
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FIGURE 148 

C CAAAGTGATCATTTGAAAAAGAGATAT CC ACAT CTTCAAGCCCAT ATAAAGGATAGAAGCT 
GGACAGGGCAGGTTTACTTACTCCAGCACCTTCCTOTCCCAGGCAAATOGTGCTGACCATCT 
TTGGGATACAATCTCATGGATACGAGGTTTTT AACATC AT CAGC CCAAGCAACAATGGTGGC 
AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 
AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 
TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 
CTCCAATGGTACATCTATGAGAAACAGG CT CTGGAC AACATGTTCTCCAACAAATACACCTG 
GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 
CCATTGAGAAACTCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 
AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 
TG CAGACATT CATGTTTAGGATGATTAGCC CTCTTGTTTTATCTTTTC AAAGAAAT AC AT C C 
TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATGCTGTATTTATA 
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FIGURE 149 

MKILVAFLWLTIFGIQSHGYEVFNIISPSNNGGNVQETVTIDNEKNTAIVNIHAGSCSSTT 
I FDYKHGYT ASRVLSRRACF I LKMDHQN I PPLNNLQWY I YEKQALDNMFSNKYTWVKYNPLE 
SL I KDVDWFLLGS P IEKLCKHI PLYKGEWENTHNVGAGGCAKAGLLG I LGI S I CAD IHV 
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FIGURE ISO 

GGCACGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 
MSQGGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 
GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 
TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 
CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 
CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 
GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 
GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 
AG C C AG ACAT CGGACTGGTTCTGGTGCCAGGCTGC AAACAACGC CAATGT CCAGCACAGCGC 
CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 
GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGG ACGCAAAG CAG CAG C CATG TAQA ATGAACC 
GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 
GTTC^TGO\AAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 151 

MGLPGLFCLAVLAASSFSKAREEEITPWSIAYKVLEVFPKGRWVLITCCAPQPPPPITYSL 
CGTKNIKVAKKWKTHEPASFNLNVTL1 ? lSSPDLLTYFCRASSTSGAHVDSARLQMHWELWSK 
PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHLQQRPCHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSALTWPPGGDQKMEDWQGPLESPILALPLYRSTRRLSEEEFG 
G FR I GNGE VRGRKAAAM 
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FTCGUME 152 

GGTCCTT AATGG CAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 
CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 
CATCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 
CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTGGGGAAGAAA 
CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACT 
TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCTGC 
AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 
TTCGATGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCC 
TGGAGC CAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCC ATGTC CTT C CAT T 
ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAG CAC C 
CTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCAC 
AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCA 
TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACG 
GTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 
GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 
TTGCC^CAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 
TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 
TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 
GTCAGTAAAATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 
AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 
TGTT CTGT ACTGATATT T AAAT AAAGAGTT CT ATT T C C C AAAAAAAAAAAAAAAAAA 
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F I G U RE 1 5 3 

InAAAAATKILLCLPLubJ^ 

HYDCGNKTVTPVSPJ^KKLNVTTAWKAQNPVLREWDILTEQLRDIQLENYTPKEPLTLQAR 
M3CEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKEKWENDKVVAMSFHYFS 
MGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPCFILPGI 
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FIGURE 154 

GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 
CCTTGGGCCTCACAATTTTCATTCTGTTTTCTGACTTTCAAGTTATATAGCGTGG 
TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAG CC CTCACCCAATTCTG 
GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTGATCAAAA 
GAGAATTTGGATTCTACTCTAAAAGT CAATATAGGACT TGG CAAAAGAAGCTAGCAGAAGAC 
TCAACCTGGCCT CCCATAAACAGGACAGATTATTCAGGTG ATGG CAAAAATGGATTCTACAT 
CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 
CCACAGAACAGCATTTCTGGGCCAGGCTGTAATCAGAATTGTCGTCGTACATGCTCAACAGC 
ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 
CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCTTGCAGCAAGGGACC 
TT AG ATAGG C TT ATTCTGACTGTATG CTTTACCAATGAGAGAAAAAAATGCATTTC CTGTAT 
CATCCTTTTCAATAAACTGTATTCATTTTGAAAAAAAAAAAAAAAAAAAAAAA 
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FIQEJMK ISi 



MELIPTITSWRVLILWALTQFV T CGFLCRGFHLQNHELWLLIKREFGFYSKSQYRTWQKKLA 
EDSTWPP INRTDYSGDGKNGFYINGGYESHEQ I PKRKLKLGGQPTEQHFWARL 
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FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATQAACGTGCCACACC ATGA AG 
CTCTTGTGGC'AGGTAACTGTGCACCACCACACCTGGAATGCCATCCTGCTCCCGTTCGTCTA 
CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 
AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 
GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 
CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 
TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 
ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 
CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 
ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 
GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 
AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 
TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 
AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 
CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 
TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 
TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 
GCACATGCGAGGCCGCTACCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 
TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 
CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 
CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 
TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 
GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 
AACAGTGGAGAC CACGGAGATCT CGC CTGAGGACACAACG CGAAAGTACAAGCCTGTTC CTA 
CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 
CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACACCACTGACAAGATGCAGACCAGCCT 
GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 
CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 
ACAGCCGCCCGGACTGTTGAGATAAT CCAGGTGGACGAAGACATCCCAGCAG CAACATCCGC 
AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 
ATGACC ATATTAACTACAACACCT ACAAAC CAGCACATGGGG CC CACTGGACAGAAAACAGC 
CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 
TACC AAGGAC AAGGTACAGGAAACTCAAAT ATGA CTCCCCTC C C C CAAAAAACTTATAAAAT 
GCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 
TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 
CAAAAAGTCAAAACA 
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FIGURE 157 

MKLLWQVTVHHHTWNAILLPFVYLTAQVWILCAAIAAAASAGPQNCPSVCSCSNQFSKVVCT 

RRGLSEVPQG I PSNTRYLNLMENN I QM I QADTFRHLHHLE VLQLGRNS I RQ I EVGAFNGLAS 

LNTLELFDNWLTVIPSGAFEYLSKLRELWLRNNPIESIPSYAFNRVPSLMRLDLGELKKLEY 

ISEGAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 

VmSQVSLIERNAFDGLASLVEIJ^LAHNl^SSLPHDLFTPLRYLVELHLHHNPWNCDCDILW 

liAWWLREYIPTNSTCCGRCHAPMHMRGRYLVEVDQASFQCSAPFIMDAPRDLNISEGRMAEL 

KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNTXSTLNFSHVLLSDTGVY^ 

ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPAYTTSTTVLIQ 

TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI I IGCFVAVTLLAAAMLI VFYKLRKRHQQRS 

TVTAARTVEIIQVDEDIPAATSAAATAAPSGVSGEGAVVLPTIHDHINYNTYKPAHGAHWTE 

NS LGNS LH P TVTT I S E P Y I IQTHTKDKVQETQ I 
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FIGURE H§g 

CGCTCGGGCACCAGCCGCGGCAAGGATSGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTT 

TTCTTCAG CTCCTTCTCATCTCGTCCTTGC vZAAGAGAGTACACAGT CATTAATGAAGCCTGC 

CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 

CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAATG 

AGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGA 

AATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTG 

CCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGG 

GTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGCTAAA 

CCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCA 

GTATGACTATGTTGAGGTTCGTGATGGAGACAAC CGCG ATGGCCAGATCATCAAGCGTGT CT 

GTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCCTCACTCCACGTCCTCTTCCAC 

TC CGATGGCT CC AAGAATTTTGACGGTTT C CATG C C ATTTATGAGGAGATCACAGCATGCT C 

CTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTG 

CCTGCTTGGCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCA 

GACCCTGGGGGCCCAGTCAATGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGG 

ACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGTTCTTAGTG 

GCAATGAGAAAAGAACTTGC CAGCAGAATGGAGAGTGGT C AGGGAAACAGCC CATCTGCATA 

AAAG CCTGCCGAGAACCAAAGATTTCAGAC CTGGTGAGAAGGAG AGTT CTTC CGATGCAGGT 

TC AGTCAAGGGAGACACCATTACAC C AGCT ATACTCAG CGGC CTT CAG CAAGCAGAAACTGC 

AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACAT 

CTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAG 

GAGGACATGTCTGAGGACTGGGAAGTGGAGTGGG CGGGCACC AT CCTG CATC CCTAT CTGCG 

GGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCC 

ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACACAAGGGAGCGTGGTTCCTAGT 

CTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACC 

TGGGGAAGGT CACCATGATC AAGACAGCAGAC CTGAAAGTTGTTTTGGGGAAATTCTAC CGG 

GATGATGACCGGGATGAGAAGACCAT CC AGAG CCTACAGATTTCTGCTATCATTCT GCAT C C 

CAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCTCCTAGACAAGGCCC 

GTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTC 

CAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTT 

CAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGC 

AGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAA 

CCCACTGCCCCITCTGATATCTGa^CTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCC 

GGGACGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATA 

AAACATGCAGCCACAGGCTCTCCACTGCCTTCACC^AGGTGCTGCCTTTTAAAGACTGGATT 

GAAAGAAATATGAA ATG^ CC^TGCTC^TGCACTCCTTGAGAAGTGTTTCTGTATATCCGTC 

TGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 

CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGC 

TGGTAGGCTGATGCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAG 

AAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCACTCCACTGACCTGGTGGTCT 

TC C CCAACTTTCAGTTATACGAATGCCATCAGCTTGAC CAGGGAAGATCTGGG CTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGC 

TGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCC 

ATCTCTTGTAC^CATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 159 

MELGCWTQLGLTFLQLLIiI SSLPRE YTV I NEACPGAE WN I MCKE CCE X uy 1 E C V CPGKRE V V 
GYTIPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 
CMRCGQVLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRD 
GDNRDGQI IKRVCGNERPAP IQS IGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDG 
TCVLDKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 
VSFFCNNSYVLSGNEKRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLH 
QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGK 
WSGRAPSCIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 
RT VWAAHCVTDLGKVTM I KTADLKWLGKFYRDDDRDE KT IQSLQI S AI I LHPNYD P I LLD 
ADIAILKLLDKARISTRVQPICLAASRDLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSG 
WSWDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWIERNMK 
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FIGURE 160 

AC CAGG CATTGTATCTTCAGTTGTCAT CAAGTTCGCAAT CAG ATTGGAAAAG CT CAACTTGA 
AGCTTTCTTGCCTGCAGTGAAGCAGAGAGATAGAT^ 

TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCT 

GACAGTGGTTGGGTGGGCCACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAG 

CAAAGGAGTT CATGGCTAATTT C CATAAGAC C CT CATTTTGGGGAAGGGAAAAACT CTGACT 

AATG AAGCAT C C ACGAAGAAGGTAGAACTTGACAACTGT C C TT CTGTGT CT C CTTACCT CAG 

AGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCACTTTGGAAGAGGTACAGGCAGAAAATC 

CCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAGCrTTACAGAGGGT^ 

GTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAACATCTGCATCCCTTCCT 

GCAGAGG CAG CAGCTGGATT ATGG CATCTACGT CAT C C ACCAGGCTGAAGGT AAAAAGTTT A 

ATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGGACTGC 

TTTATATTCC ACGATGTGGACC TGGTACCCGAGAATGACTTTAACC TTTACAAGTGTGAGGA 

GCATCCCAAGCATCTGGTGGTTGGCAGGAAC^GCACTGGGTACAGGTTACGTTACAGTGGAT 

ATTTTGGGGGTGTTACTGCCCTAAGCAGAGXGCAGTTTTTCAAGGTGAATGGATTCTCTAAC 

AAOTACTGGGGATGGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAAT 

GAAAATTTCCCGGCCCCTGCCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACA 

AAGGCAATGAGGTGAACGCAGAACGGATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGA 

ACAGATGGGTTGAGTAGTTGTTCTTATAAATTAGTATCTGTGGAACACAATCCTTTATATAT 

CAACATCACAGTGGATTTCTGGTTTGGTGCATGACCCTGGATCTTTTGGTGATGTTTGGAAG 

AACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAAATAGTAGCACACATTAAGA 

ACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCTTAGCAGAGCTCCT 

GGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTC TT AG TCATTTTGATCATGAGG 

GTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAAA 

ATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTAT 

GGGATAAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGT 

CCAAGGTAGAAAGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTG 

AAGTGGTGGTGTCAGGTGAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGA 

CAC71GTGAACTTGGGAATGAAGAGGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAG 

TAGCTGAGCTGGTTGCAGGTGCTGATAGCCTTCAGGGGAGGACCTGCCCAGGTATGCCTTCC 

AGTGATGCCCACCAGAGAATACATTCTCTATTAGTTTTTAAAGAGTTTTTGTAAAATGATTT 

TGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACATATTAACTAATAATAAATATGT 

CTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 1161 

LTNEASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRV 
A I LVPHRNREKHLMYLLEHLHPFLQRQQLD YG I YVI HQ AEGKKFNRAKLLNVGYLEALKEEN 
WDCFIFHDVDLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNG 
FSNNYWGWGGEDDDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSR 
VWRTDGLS S CS YKLVS VEHNPL Y INI TVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites: 

amino acids 4-7, 220-223 and 335-338 

Xylose isomer ase proteins: 

amino acids 191-201 
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MGTOE 162 

CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 
GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGCAATO 
GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 
CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 
CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 
CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 
GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 
TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 
GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 
CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 
GCAACTT C CT CAAG AC CGCCTTGTTTCT CTAC AGCTGGGACTTCGGGGACGGGAC CCAG ATG 
GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 
CAAAGTGGTGGCGGAGTGGGAAGAGGTGG AGCCGGATGC CACGAGGGCTGT GAAG CAGAAGA 
CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 
CCCACCCTAATTCAGACCTTCCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGCCCTCC 
TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCTCCCGCTGGAGGAAGGGGAGTGCCACC 
CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 
TG CTTCAGCATC CGGGC CGAGAAT ATCATCAG CAAGACACATCAGTACCACAAGATC CAG GT 
GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 
TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 
AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 
GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 
ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 
CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 
CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 
TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 
TTGATGCTTGGGGTGTTCCGTGTTGACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 
CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCCCAGACAGCCCTAC 
CTGTGC CAGAGAG CTAGAAAGAAGGT CATAAAGGGTTAAAAATCCAT AACT AAAGGTTGTAC 
ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 
CACACACACACAGAAATATAAACACATGCGTCACATGGGCATTTCAGATGATCAGCTCTGTA 
TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 
AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 
GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 
AATACTGAG TGATTGCAGAGTGCTTTATAAATATCAC CTTATTTTAT CGAAAC C CATCTGTG 
AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 
TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 
GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 
AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 
GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
GTGACAGAGCGAGACTCTGTCTCCA 
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FIGURE 163 

I^QAVWSRLGRILWLACLLPWAPAGVAAGLYELNLTTDSPATTGAVVTISASLVAKDNGSLA 
LPMAHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVWTAADCWMCQPVARGF 
WLPITEFLVGDLWTQNTSIiPWPSSYLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
MVTEDSWYYNYS I IGTFTVKLKWAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQVL 
GPTLIQTFQKMTVTLNFLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YC FS I RAEN IIS KTHQ YHK I Q VWP S R I Q P AVF A F P C ATL I TVMLAF I MYMTLRNATQQ KDMV 
ENPEPPSGVRCCCQMCCGPFLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domain : 

amino acids 339-362 



N-glycosylation sites. 

amino acids 34-37, 58-61, 142-145, 197-200, 300-303 and 364-367 
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FIGURE 164 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATOGCACTGAGCTCCCAGATCT 

GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCA 

CAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTG 

GATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCT 

GCGGCTGCTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTASAACCTACCTGCCCTG 

CCCCCGTCCCCTCCCTTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAA 

TGGCTGGTTCTTTTGTTTTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 165 

MALS £ Q I V7AACLLLLLI-LAS LTS C £ Y F T QQTG Q LAE LQ P Q C RAG ARA3 m-1 PM RQ RRRRRDT K 
FP I C I FCCGCCHRS KCGMCCKT 
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PCT/US99/12252 



FIGURE 166 

CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 
CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 
ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 
AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTA 
CCTTGAGAATGGAGCGAGGAGC CAAGGAGAAGAACCACCAGCTTTACAAG CCCTACACCAAC 
GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGG 
TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 
GGAAAGGAATGGAGACCATTATGGATGATGAGGTGAC AAAGAGATT CT CAGCAGAAG AACTG 
GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 
GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 
TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCAAAT 
GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 
AGCGCTGACAGCCATCATCACCTACCATGACAGGGAAAACAGACCAAGAAATGGTGGCATCT 
GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 
ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 
CCCACACGTCTGGTTTGAGCGCTCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 
CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTGCATC 
AATAATACATCGGTGATGATGTTCAAAAAGGGAAGTTTTGAAATTGGAGCCACAGTTTACCC 
TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 
TGGTGACGTACCTGCTGCGAATGATGACCAGCTGGGCCATTGTCTGCAGCGTGTGGTACCTG 
CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 
CATTGCCAGGCAGGGAGGACTTGTGGACCTGCTGTGGGATGGGGGCCTGAAGAGGGAGAAGG 
TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAACCAC 
AAGGACAGGAGCCGCTCCTSAGCCTGCCTCCAGCTGGCTGGGGCCACCGTGCGGGGTGCCAA 
CGGGCTCAGAGCTGGAGTTGCCGCCGCCGCCCCCACTGCTGTGTCCTTTCCAGACTCCAGGG 
CTCCCCGGGCTGCTCTGGATCCCAGGACTCCGGCTTTCGCCGAGCCGCAGCGGGATCCCTGT 
GCACCCGGCGCAGCCTACCCTTGGTGGTCTAAACGGATGCTGCTGGGTGTTGCGACCCAGGA 
CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGGAGGAATGCCATTAAAGTGAACTCCCCA 
CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCTAGAGAT 
GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCAGGGATGTGCTGGGGGCGGCCACCCG 
CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCTCAAGATCGCCCCCAGC 
CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCTGCAGGGGCTTTCAGCAAAATG 
AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGGGAGGGGAAGAGGCCAGGTG 
GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTACTCCAGGCTAACCCTGAACTCCCC 
ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCGGTTTCCCCATCTGTAATATGAGTCGGGGG 
GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATTAAAGTGCTGCGGGTGAGTGA 
AGGACACATCACGTTCAGTGTTTCAAGTACAGGCCCACAAAACGGGGCACGGCAGGCCTGAG 
CTCAGAGCTGCTGCACTGGGCTTTGGATTTGCT^ 
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FIGURE 167 

MFLLLPFDSLIVNbLCiiyijTVLFTijLLVFI I VPAIFGVSFGIRKLYMKSLLKIFAWATLRME 
RGAKEKNHQLYKPYTNGI IAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEVTKRFSAEELESWNLLSRTNYNFQYISLRLTVLWGLGVLIRYCFLLPLRIALAFTG 
I SLLWGTTWG YL PNGRFKEFMS KHVHLMCYRI CVRALTA I 1 TYHDRENRPRNGG I CVANH 
TSPIDVIIIASDGYYAMVGQVHGGLMGVIQRAMVKACPHWFERSEVKDRHLVAKRLTEHVQ 
DKSKLPILIFPEGTCINNTSVMMFKKGSFEIGATVYPVAIKYDPQFGDAFWNSSKYGMVTYL 
LRMMTSWAIVCSVWYLPPMTREADEDAVQFANRVKSAIARQGGLVDLLWDGGLKREKVKDTF 
KEEQQKLYSKMIVGNHKDRSRS 
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FIGURE 168 

GCCCCTCGAAACCAGGACTCCAGCACCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTCA 
CGTCTCCTCCAGGGASSGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 
ACCTGGCAGGCCCAGGCTGTTCCCACCATCCTGCCCCTGGGCCTGGCTCCAGACACCTTTGA 
CGATACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCCCTGCTAAAGGAGG 
AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 
AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGAATGGAATAGCCATTATGGT 
CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 
GCTCCCGGGAGCTCTACATGAGGCACTTTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 
GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 
AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 
TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 
GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 
GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 
ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 
CAGCCTTGAGAAGCAAGAACATGGTT C CGGAC CC AG CCCT AG CAG C CTTCTCCC CAAC CAGG 
ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 
GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 
TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 
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FIGURE 169 

MAUUUjM I ALGSLGLHTWQ AQ AVP 

ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSKTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEWFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRFGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEFQLSGVGP 
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FIGURE 170 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT&TgGCTGGTTCCCCAACATGCCTCA 
CCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTG 
GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 
TATTGTCTGGACCTT CAACACAAC CC CT CTTGTC ACCATACAGC CAGAAGGGGG CACTAT C A 
TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 
CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 
CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 
TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 
GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 
CCATAATGGGTC CATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGAC CTTC AT CT 
GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 
GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 
CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 
AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCGCCAT 
TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 
TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 
TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATCI&SACAGCAGTG 
CACTCCCCTAAGTCTCTGCTCA 
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FIGURE 3171 

MAGS PTCLTL I Y I LWQLTGSAASGPVKELVGS VGGAVTFPLKSKVKQVDS I VWT FNTTPLVT 
IQPEGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHV 
YEHLSKPKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRW 
GESDMTFICVARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLW 
FLKRERQEE Y I EEKKRVD I CRETPN I CPHSGENTE YDT I PHTNRT I LKEDP ANTVYSTVE I P 
KKMENPHSLLTMPDTPRLFAYENVI 
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FIGURE 172 

CTGGTTCCCCAACATGCCTCACCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCC 
TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 
CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATAC 
AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 
GATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGT 
GGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACG 
AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 
AC CAATCTGACATG CTGCATGGAACATGGGGAAGAGGATGTGATTT AT AC CTGG AAGGCC CT 
GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 
AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCC 
ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 
GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 
TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 
ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 
TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 
AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 
AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 



1 
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FIGIME 173 

GAAAGACGTGGT CCTGACAGACAGACAATCCTATTC CCTACC AAA&I2AAGATG CTGCTG CT 
GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 
ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAGA 
GAAAAGATAGAAGAACATGGCAACTTTAGACTTTTT CTGGAGCAAATC CATGTCTTGGAGAA 
TTCCTTAGTTCTTAAAGT CCAT ACTGTAAGAGATGAAGAG TG CT C CGAATTATCTATGGTTG 
CTGACAAAACAGAAAAGG CT GGTGAATATT CTGTG ACGTATGATGGATT CAATACATTTACT 
ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 
AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 
GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 
GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 
ACTTCTCACCAGGACTCCACCATCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 
TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 
TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 
TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGCACTTAA 
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FIGURE 174 

MKMLLLLCLGLTLVCVHAEE AS STGRNFNVEK INGE WHT 1 1 LASDKREK I EEHGNFRL FLEQ 
IHVLENSLVLKVHTVRDEECSELSMVADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLMAHLI 
NEKDGETFQLMGLYGREPDLSSDIKERFAQLCEEHGILRENIIDLSNANRCLQARE 
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FIGURE 17f 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACC&TOACCTGCTGCGAAGGATGGACATCCTGCAA 
TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 
TCAGCTTAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 
CCAGGAATTATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAG 
AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCA 
CAGTCATTGGTGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTC 
ATGTGTAATTCTCCAAGCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGA 
CATTCATCCAGAATCCTTCAACTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTG 
GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCAC 
TTCGATTCTGAAGAAAACAAACATAGGCTTATCGACTTCTCAGTATTTTTAGGTCTATTGCT 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGIASTTTAATGGGAATAAAATGTAAGTA 
TCAGTAGTTTGAAAAAAAAAAA 
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FTGURE 176 

MTCCEGWTSCNGFSLLVLLLLGVVLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMA 
I PATTMSLTARKRACCNNRTGMFLSSFFSVITVI GALYCMLI S IQALLKGPLMCNSPSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRL 
IHFSVFLGLLLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 
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FIGURE 177 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 
GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCTTATTCTTAAGTGACGCTGCGGTAAACCTC 
C AAGTTGCC AAACTT AAT C C AC CT C C AG AAGCTCTTG C AG C C AAGTTGG AAG TG AAG C AC TG 
CACCGATCAGATATCTTTTAAGAAACGACTCTCATTGAAAAAGTCCTGGTGGAAATAGTGAA 
AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTCAACGACACC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 
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FIGURE 178 



MRLSVCLLMVSLALCCYQAHALVCPAVASEITVFLFLSDAAVNLQVAKLNPPPEAliAAKLEV 
KHCTDQISFKKRLSLKKSWWK 
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FIGURE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGCaGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGAC^TGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 
CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATGCAAATACAAGAGCAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCAC 
TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 
AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 
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— A83/1AO 

FIGURE 180 

MERVTIJ^LLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLiAIAGIAAVLSGKCK 
YKSSQKQHSPVPEKAIPLITPGSATTC 
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FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGilSTCGCTGCTGAGCCTGCCC 
TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 
CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 
AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 
ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 
ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 
CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 
CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 
GCCCGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 
TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 
CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 
TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 
GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 
TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 
CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 
ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 
ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 
CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 
AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 
TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 
ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 
TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 
CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 
CGCAGGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 
CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 
TTGATCATGCGCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 
GTGACTTTCTGACCCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTC 
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MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWG 

HLGLITPTEEGLKDSTQMSATYSQGPTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDKLF 

IRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGS 

SRLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYY 

LSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDG 

KALSDED I RAEADTFMFGGHDTTASGLS WVLYNLARHPEYQERCRQEVQELLKDRDPKE I EW 

DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVW 

PDPEVYDPFRFDPENSKGRSPLAFIPFSAGPRNCIGQAFAMAEMKWLALMLLHFR^ 

EPRRKLEL I MRAEGGLWLR VE PLNVGI/Q 
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FIGURE 183 

CAACAGAAGCCAAGAAGGAAGCCGTCTATCTTGTGGCGATCaifiTATAAGCTGGCCTCCTGC 
TGTTTGCTTTTCACAGGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAGGGA 
AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 
GAGCTTCCCTTCTACAGATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 
AAAGCAGACTCAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTCAGGATTT 
CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 
AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCSSAAGTGAAATAAGCATCTGT 
TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 
AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 
ATGTTTTGCACAAAAAAAAAAAAAAA 
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FIGURE 184 



MYKLASCCLLFTGFLNPLLSLPLLDSREISFQLSAPHEDARLTPEELERASLLQILPEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWKYCV 



WO 99/63088 „ / PCT/US99/12252 

FIGURE 185 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGCC ATG CCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAGTGATCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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FIGURE 186 

MPSPGTVCSLLLLGMLWLDLjAMAGSSFLSPEHQRVQQRKESKKPPAKLQPRAIiAG 
* GQAEGAEDELEVRFNAP FDVG I KLSGVQYQQHSQALGKFLQD I LWE EAKEAPADKO " " 
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FIGURE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 
GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 
TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 
CTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 
GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 
TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 
GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 
GGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 
TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 
GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 
CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 
AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 
AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 
AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 
TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 
ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 
ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 
GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 
GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 
GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 
ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 

VVRAAHLTPLLEYVPNPEPPTP 
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FIGURE im 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACC&IfiGCCAAG 
ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGCATGCT 
ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTGGGCACACAGAAGG 
TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 
GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 
CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 
AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 
GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA 
GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 
TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 
ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTG 
CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 
TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 
GGCCTTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 
ACACGTACACCAGGATGGTGCTGGAGTTCAAGTGCAAGCATA5TAAGAGCTTCAAGGAAAAC 
CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 
CGTGGGTCCTTTGAC GAG CTACCACCAGTATCATAATCAGCC CATC CACTCTGTCTCTGAGG 
GAGTCGACTTCTACTCCGAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTG 
AAAGAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 
GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 
TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 
CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCAC 
CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 
TAAACCATGGAGATAAAAAGAAGAGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 
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FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 191 

AACTGGAAGGAAAGA?VAGAAAGGTCAGCTTTGGCCCAG ATG TGGTTACCCCTTGGTCTCCTG 
TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCAGGCCTGTCA 
GCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCTTCAGAGCAAACAGGACAACCTATG 
TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 
GATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 
AGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 
GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 
CAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 
AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 
GACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 
CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 
ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 
GAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 
GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 
TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 
ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 
AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 
AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCCCAATGGTTTCTCCAATTATGCC 
CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNEliCFCDLYDRATSPPLKCSLL 
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FIGURE im 

GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 
GGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCA 
ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTGGCCCCT 
CTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAGATTAGTGG 
ATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTACAACGGGCATT 
GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 
TTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGG 
TCITTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGGTGAAAAGAAATTACTGAACTATTG 
TCAAATGGACTTCCTdTCATTTGTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGC 
TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTCTCACTTTTATTGTAAG 
CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCTTTTGGAAAAGCTTG 
ACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 
GTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAA 
TGTAATCATTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACA 
TCTTTTCTCTTAAAATTATTTAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCAG 
TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAGACGGTTGGCATACGTTATA 
GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 
CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 
ATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAA 
ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 
TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCT 
ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGAC 
ATACATGGAAC CACTACTGATGAGGG ACAGTTGT ATGTTTGCAT CATAT ATG C CAGAAAACC 
TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 
CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 
CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 
ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 
TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAA 
CTGTTTTGATl^TGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 
GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 
AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAG 
GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 
TATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTT CGC CATC AGCCAAAACT 
CAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAA 
TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTATTGCCTAACT 
TAAG CCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATAT CGGCGTGTGGCTGG 
AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 
GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 
CTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 
TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 
GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 
GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAATTTTCCATAGAATAT 
GCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAAAG 
ATTTTAAATATCTATTTTAAAAAAAAAA 
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FIGURE 194 

^GIKALISLSFGGAIGLMFLMLGCALPIYNKYWPLFVLFFYILSPIPYCIARRLVDDTDAM 
SNACKELAIFLTTGIVVSAFGLP1VFARAHLIEWGACALVLTGNTVIFATILGFFLVFGSND 
DFSWQQW 
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FIGURE 19SA 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 
CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 
GCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 
CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 
CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAATGACGGACATT 
AACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAA 
CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGC 
ACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCA 
GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 
GGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCATATAATTAGCCTGCACACAA 
AGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCGCTTCCAA 
TGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCT 
GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 
CAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 
GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGT 
CCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTC 
ACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACT 
GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCT 
TCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGCG 
CCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCA 
GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 
ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCAC 
CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGCA 
CTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGA 
ACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAG 
ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCG 
GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCG 
GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAAC 
TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 
CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 
GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 
CACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 
CGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCATCGTGG 
TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 
TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTC 
TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCA 
ACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCC 
CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCA 
CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 
TCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACAC 
AACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCC 
TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 
GACAGAGAGAGAATAAT CGTGCTTTGTTTTATG CTACTC C TC CC AC CCTG CC CATGATTAAA 
CAT C ATGT ATGT AGAAGATC TTAAGTC CATACGCATTT CATGAAGAACCATTGGAAAGAGG A 
ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTG 
TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAA 
CGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGG 
AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACT 
TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTTGCTCTGACCCCTC 
TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTT 
TATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCC 
ACAATTCACGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCA 
GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAAAAAGTCTT 
TTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCA 
GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGA 
TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACT 
CCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 
TCAACCATTTAAAAATATTCTTCTGAACTCATCAAAGTAGAAGAGCCCCCAACCTTTTCTCT 
CTGCCTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTATAT 
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FIGURE 19EM 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCCAAGCAAGTACAC 
ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCT7ATTTTAGTCGACTTGTGAG 
CAAGTTTGATGCCTAGTCTATCTG ACATGGC CCAGTAGAACAGGGCATTG ATGGAT CACATG 
AGATGGTAGAAGGAACATCAT CACATACCC CT CT C AC AG AGAAAATT AT CAAAGAACCAGAA 
ATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAAT 
TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAA 
TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGG 
TTTAGAATGTCA 
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FIGURE 196 

MDFLLLGLCLYWLLRRPSGWLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS 
LKFLDIGYNQLKSLARNS FAGLFKLTELHLEHNDLVKVNFAHFPRL I SLHSLCLRRNKVAI V 
VSSLDWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSIT 
LAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKWTGTMA 
LIFSFLIVVLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
IEGALVI INEYGSCTCHQQPARECEV 
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FIGURE 197 

GTGCAAGGAGCCGAGGCGAGaigGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GC AC TG AC CCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCTGGCGCTAG 
CCCCGTGCGTGTCCGCAGCATCT CGG CT CTGGGC CGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 
GATCTGCGCGGCCCTGCTCCAGCCCCT 
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FUGURK 198 

MGVLGRVLLWLQLCALTQAVSKLWVPNTDFDVAANWSQNRTPCAGGAVEFPADKMVSVLVQE 
GHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFF PPS ASFRVGLGPGASP VRVRS I SALGRTFTRDEDLAVFLjASR 
AGRLRFHGPGALSVGPEDCADPSGCVCGNAEAQPWICAALLQP 
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FIGURE 199 

AT CG CATCAATTGGGAGTACCATCTTC CTCATQGGACC AG TGAAAC AG C T GAAGCG AATGTT 
TGAGCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTT 
CTGCCTTTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCA 
TTGACGTGGTACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGC 
CGTGTGTCTTGCASftATTCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAG 
CTGGTGGACAGTTTTGTAACTATCTT CGAAAC CTCTGTCTTACAGACATGTG CC TTTTATCT 
TGCAGCAATGTGTTGCTTGTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACAT 
TCTCGAACCTGAATGTCAGTAGCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAA 
TCTTCCTCATGTACCTGTTTCCTCTCTGGATGTTGTCCCACTGAATTCCCATGAATACAAAC 
CTATTCAGCAACAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2M 

MGPVKQLKRMFEPTRLIATIWLLCFALTLCSAFVmHNKGLALIFCILQSLALTWYSLSFIP 
FARDAVKKCFAVCLA 
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FIGURE 201 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTA 
CCCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACG 
GACTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCA 
ACTACCATCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGAC 
TTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTC 
TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTA 
TATAAAGTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGT 
TACTAATGTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTG 
CAGAGAATCATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCC 
TTGGATCACATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCAC 
AAACCAGAGGGCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATAC 
ATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTT 
GCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGA 
AGACCCTGATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTT 
CAGATATTGACAAGAAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAAC 
ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 
AG AACTTG ACCAGTACCTGGATAAAGACCACTATAC CC TG ATTGAT CAAT CTC CAGTAGCAG 
CCATCTTGCCAAAAGAAGGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCT 
AATCTTACTGTTTACAAAAAAGAAGACGTTCCAGAAAGGTGGCATTACAAATACAACAGTCG 
AATTCAACCAATCATAGCAGTGGCTGATGAAGGGTGGCACATTTTACAGAATAAGTCAGATG 
ACTTTCTGTTAGGCAACCACGGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTA 
GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 
CCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACACAATGGATCATTCTGGAATG 
TCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTC 
CT CC CTGGTAGTGTTAAACCAG CAGAATAT GACC AAGAGGGGTCATAC C CTTATTTCATAGG 
GGTCTCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTC 
ACAGTCAAATACCTGCCTTACAAGATATGCATGCTGAAATAGCTCAACCATTATTACAAGCC 
TA&TGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGTTT 
AAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGCAGAAACCATTAAGCAGTTACAT 
ATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAAATACTTACAC 
CTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATA 
AGATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGT 
AAAGTACCTTATATATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCA 
CTTTATGGACAG TTATGT CTT AT AACTTG ATTGAAAAT GACAACTTTTTG CACC CATGTCAC 
AGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATT TCTAATAATC CCGAATAATGAACA 
TAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAG GT GATAAGTGTTGAAAATTAAATGTG 
ATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGCAT 
TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTC 
AAATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGA 
TGAGTCATATTACTGTGATTTTCATAATAATGAAGACACCATGAATATACTTTTCTTCTATA 
TAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTCTTGTT 
TGTAATTATTTGCTCCTTTGAAAATTAAATCACTATTAATTACATTAAAAATCAAATTGGAT 
AAAAAAAAAAAAAAAAAAA 
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FIGURE 202 

MTSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVK 
QVTNVF ITKTYPNHYTLVTGLF AENHGI VANDMFD P I RNKSF SLDHMN I YDS KFWEEATP I W 
ITNQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLY 
WEDPDDMGHHLGPDSPLMGPVISDIDKKLGYLIQMLKKAKLWNTLNLI ITSDHGMTQCSEER 
LIELDQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYN 
SRIQPIIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFIiAHGPAFRKNFSKEAMNSTD 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRWPYTQSTILLPGSVKPAEYDQEGSYPYF 
I GVSLGS I IV I VFFVI F I KHL I HSQ I P ALQDMHAE I AQ PLLQA 
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FIGURE 203 

GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 
GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 
AGAGAGGCCAAGCCCCTTGCCTTGGGTCACACAGCCAAAGGAGGCAGAGCCAGAACTCACAA 
CCAGATCCAGAGGCAACAGGGACA2SGCCACCTGGGACGAAAAGGCAGTCACCCGCAGGGCC 
AAGGTGGCTCC CGCTGAGAGGATGAGCAAGTTCTTAAG GCACTTCACGGT CGTGGGAGACGA 
CTACCATGCCTGGAACATCAACTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 
AGCAGCCACCAC CCACAC CAGTCT CAGGCGAGGAAGG CAGAG CTGCAG CCCCTGACGTTGCC 
CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGCATGTTGAGGAAACTGTT 
CAGCTCCCACAGGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 
TTGCTGAG CT CATC CTGGACCTGAAGATCATCCAG CC CGACAAGAATAAC TATG CT GC CATG 
GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 
ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 
TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 
CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 
TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 
AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACTGATGAGTTTGCTGTATC 
AACCTGTAAGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 
CACACAGCCACCGTGAAAGT CCTGGAGTAAAATG TG CTGT GT AC AG AAGAGAGAGAAGGAAG 
CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 
TCACTTCAGATTACAAATC^CACAGAGCATCTC 

AAAATCTATAAAGATATTCTGAAAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 204 

MATWDEKAVTRRAKVAPAERMSKFLRHFTVVGDDYHAWNINYKKWENEEEEEEEEQPPPTPV 
SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQ VI 1 1 CL WLDALLVLAEL I LDL 
KIIQPDKNNYAAMVFHYMSITILVFFMMEIIFKLFVFRLSSFTTSLRSWMPVV\AA/SFILDI 
VLLFQEHQFEALGLLILLRLWRVARIINGI 1 1 SVKTRSERQLLRLKQMNVQLAAKIQHLEFS 
CSEKPLD 
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FIGURE 2<0>g 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACM 
GCTCTGTCTGTGCCTGTACGTGCUGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CGTC CT CAAAATTG CC C C CGAATCAG CCATCAAATTCATGGC CTATGAGC AGATCAAGCGCC 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GG GG CC AT CG CC CAGAGCAG CAT CTACC CAATGGAGGTCCTGAAGAC CCGGATGG CGCTGCG 
GAAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGCCGCCCG 
GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTG CCAATAGTGAG CTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
C CCA CATC CCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGC CGGACAAATGAG CG AC TTCTGTGCTT CCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGC^GGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAAAGAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2(0)6 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILK 
SMDKNGTMTIDWNEWRDYHLLHPVENIPEI ILYWKHSTIFDVGENLTVPDEFTVEERQTGMW 
WRHL VAGGG AGA VS RT CT AP LD RLKVLMQVHA S R SNNMG I VGG F TQM I REGGARS L WRGNG I 
NVLK I APESAI KFMAYEQ I KRLYGSDQETLRI HERLVAGSLAGAIAQSS I YPMEVLKTRMAL 
RKTGQYSGMLDCARRILAREGVAAFYKGYVPNMLGIIPYAGIDLAVYETLKNAWLQHYAVNS 
ADPGVFVLLACGTMSSTCGQIiASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFG 
LYRGLAPNFMKVI PAVSISYWYENLKITLGVQSR 
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FIGURE 2(0)7 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCC&I 
GG CTT CC CTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCAT CATTATTCTGG CTGGAG 
CAATTGCACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTC 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 
ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 
AAGAAGGCAAAGATGAGCTGTCGGAGCAGG ATGAAATGTT CAGAGGC CGGACAG CAGTGTTT 
GCTGATCAAGTGATAGTTGGCAATGC CTCTTTGCGG CTGAAAAACGTGCAACTCACAGATGC 
TGGCACCTACAAATGTTATATCATCACT TCTAAAGGCAAGGGGAAT G CTAACCTTGAGTATA 
AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 
CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 
GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 
TGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAA 
AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATCGGAGATCAAAAGGCGGAG 
TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 
GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 
CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 
TTTTAT ATTTCTGGGAGGAAATGAATTCATAT CTAGAAGT CTGGAG TG AG CAAACAAGAGCA 
AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAA 
GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 
TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 
GGGGAGTGAGAGGACAGGATAGTGCATGTT CTTTGT CTCTGAATTTTTAGTTATATGTGCTG 
TAATGTTG CT CTGAGGAAGCCCCTGGAAAGTCTAT C CC AACATATCCACATCTTATATT C CA 
CAAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 
GGCGGCTG CATT TTAGTAATGGGT CAAATGATTCACTTTTTATGATGCTT C CAAAGGTGC CT 
TGGCTTCT CT TC CCAACTGACAAATG CCAAAGTTGAGAAAAATGAT CATAATTTTAGCATAA 
ACAGAGCAGTCGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAA(^UVAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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CTGIME 2m 

MASLGQILFWSI ISI 1 1 ILAGAIALI IGFGISGRHS ITVTTVASAGNIGEDGILSCTFEPDI 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 
AGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVD 
QGANFSEVSNTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRR 
SHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK 
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FIGURE 2(Q)$> 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTG3CCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGATTGAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTCTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTG CT CAC CGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGAC CCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTC CT CTCTTATGGGGATAACAGCTACCT CATGGATCACAATAAG AGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCT GC GGGGATTTAG CACAGGAGACTC TACG CT CA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAG AG CG CTTTGG CTGAAT GAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAAATGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
TCCAGCACACCTGGCGAGTAGTAGCTGTCAATAAATCTATGTAAACAGACAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 21(D) 

MAASLGQVLALVLVAALWGGTQPLLKRASAGLQRVHEPTWAQQLLQEMKTLFLNTEYLMPFL 
LNQCGSLLYYLTLASTDLTLAVPICNSLAI IFTLIVGKALGEDIGGKRKLDYCECGTQLCGS 
RHTCVSSF PEP I S PEWVRTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGVWD 
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FIGURE 21111 

CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 
GAAGATGACAGCAATTATAGCAGGAC C CTG CCAGGCTG TCGAAAAGATTCCGCAATAAAACT 
TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 
TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACCATGAAGGGCATCCTCG 
TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGAATCTCTGAGCTGCGTGCAGTGTAAT 
TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTGAATGTCCCTCACATGCCAACACCAG 
CTGTATCAGCTC CT CAG C CAGC TCCTCTCTAGAGACACCAGT CAGATTATACCAGAATATGT 
TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 
GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 
CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 
AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 
TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 
CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 
TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 
TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 
GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 
AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 
CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 
AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTSCRGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHNVGSKASLYLLALiASLLLRGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 
GGGCTTGCCTCACTGGCCACCCTCCCAACCCCAAGAGCCCAGCCCCASaGTCCCCGCCGCCG 
GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 
ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 
CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 
ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 
ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 
GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 
ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 
ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 
CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 
CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 
TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCACGGCCTTTCCGGGCGCCT 
TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 
CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 
GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 
AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 
GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 
GACAGAAACCAGAGGSftATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 
CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 
CACAA 
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FIGURE 214 

MVPAAGALLWLLJ^LGFKAAUAUG^ 

ILEDENDAMADADRLAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGVVINAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTMSRSRSGKLHGLSGRLRVGALSQLRTEHKPC 
TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 
FWKRVRIGLEDIWNSLSSVFTEMQPIDRNQR 
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CCCGGGTCGACCCACGCGTCCGGGGAGAAAGG&ISGCCGGCCTGGCGGCGCGGTTGGTCCTG 
CTAGCTGGGGCAGCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGA 
CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCC 
GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACGACTGTAAGTATGAGTGT 
ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACAAAGTGCCTCAGTTCCATGGCAA 
GTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCA 
ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCC 
ATGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGT 
CTTCCACACCAGGGACACTGACCTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCA 
TCCTACACTCAATCTACCTGTGCTGCGTCAGGACCGTGGGGCTGCAGCACCCAGCTGTGGTC 
AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCTCCTACCTGAGCCTCAT 
CCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTGGTGT 
GGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTG 
GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCCCACCGCTCTTCTG 
GGTCCTGGATGCCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCA 
GCTTTCTGGAAGATGACAGCCTGTACCTGCTGAAGGAATCAGAGGACAAGTTCAAGCTGGAC 
TQAAGACCTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCCGCCCTGCTGGCCTCCCTT 
CTCCCCTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGAAGGATG 
TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTT 
CTAGGGAAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCT 
GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCC 
CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTGGGTCCTGAAGCCCTCTGTCTACCT 
GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTGTTACCACCCCCCAC 
CCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCACG 
GCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCT 
GACCGTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTC 
CAGCAAGCCCAGGGCAAGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGCCACCGTGTGTCG 
GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGT 
GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTGGCTTCAA 
AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTGTGCGCGTGCT 
GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGG 
ATGAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCA 
CCTGGGCGGACAGCCAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGG 
TGCCCCTTTGCCCGCCTCCTGCAAACCTCACAGGGTCCCCACACAACAGTGCCCTCCAGAAG 
CAGCCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGCTGGGGCTCTCTCCATCCTCCTTTT 
CTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAACCTCCATTCCCCTGCTGCCAGCCCC 
TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 
TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGG 
CAGGCAGGGGCCACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACCAGCAGC 
CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAA 
AGGCAGTCTCCGTGGTTGAAGCAGACTGGATTTTTGCTCTGCCCCTGACCCCTTGTCCCTCT 
TTGAGGGAGGGGAGCTATGCTAGGACTCCAACCTCAGGGACTCGGGTGGCCTGCGCTAGCTT 
CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAATT 
CCAAGCCTCAAAAAAAAAAAAAAAAA 
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FIGURE 216 

MAGLAARLVLLAGAAALASGSQGDREPVYRDCVLQCEEQNCSGGALNHFRSRQPIYMSLAGW 
TCRDDCKYECMWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVASFLNGLASLVMLCR 
YRTFVPASSPMYHTCVAFAWVSLNAWFWSTVFHTRDTDLTEKMDYFCASTVI LHS I YLCCVR 
TVGLQHPAWSAFRALLLLMLTVHVSYLSLIRFDYGYNLVANVAIGLVNVVWWLAWCLWNQR 
RLPHVRKCVVWLLLQGLSLiLELLDF PPLFWVLDAHAI WH I STI PVHVLFFSFLEDDSLYLL 
KESEDKFKLD 
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FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 
OTATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGCTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 
CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGC 
TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGG 
CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 
ACTC CAAT ATGAGGAC AAGT TC CGAAATAATTTGAAAGGC AAAAGGCTGGATATCAACAC CA 
AC ACCTACACAT CT CAGG ATCT CAAGAGTG CACTGG CAAAATTCAAGGAGGGGGCAGAGATG 
GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGA 
GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 
TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 
TTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 
TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 
TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 
CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 
TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 
CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 
GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAG 
GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 
GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 
TCAGGACCCCCAGCTCGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 
GCCTGGAGCTGCAGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 
AGCTTGCTGAAGGAGCTGAGATG&GGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 
GGCTGAGGGGTGCCAGCGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCT 
TGGC CATT AAATGGAAAC CTGAAGG C CAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FIGURE 218 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEEL 
DAEVLEVFHPTHEWQALQPGQAVPAGSHVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTN 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIETDMQIMV 
RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGAAFSSNPKVQVEAI EGGALQKLLVI LATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKL 
GGLQVLRTLVQEKGTEVLAVRVVTLLYDLVTEKMFAEEEAELTQEMSPEKLQQYRQVHLLPG 
LWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 
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FICIME 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 
CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 
CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 
TTCCCACCTGACCAGCCATOGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGC 
CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 
CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 
TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 
GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 
TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCAC CCATCTCCATC CGCCAGATGGCCT 
ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 
GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 
AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTTTGATG 
CCTGTGAG AG GAGACGGT ACTGGGCTTTGGGCCTGGTGGTTGGGAGTCAC CTACTGAC AT CG 
GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 
TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 
TCTTGTGTAAGGACIS&CTACCTGGACTGATCGCCTGACAGATCCCACCTGCCTGTCCACTG 
CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 
GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 
TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 
CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 
CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 
GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 
CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 
GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 
GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 
AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 

GAGGTGGGGGGAGGGAGGAGGT ATATTGGAACT C TT CT AAC CT C CTTGGG CTATATTTTCTC 
TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 
GGGGGAAAGGAAGGAAGTGC ATGTTTGGGAACTGG CATTACTGGAACTAATGGTTTTAAC CT 
CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 
CACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 
GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 
ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 
AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 




PCT/US99/12252 



FIGURE 220 

MGAAVFFGCTF'VAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASWWFILVHVTDR 
SDARLQYGLLI FGAAVSVXiLQE VFRFAYYKLLKKADEGLASLSEDGRS P I S I RQMAYVSGLS 
FGIISGVFSVINILADALGPGWGIHGDSPYYFLTSAFLTAAIILLHTFWGWFFDACERRR 
YWALGLWGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQRSLLCKD 
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FIGURE 221 

AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TNTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAG CCATTATCCTG CT C 
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FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 
TCCTTCTACAGGAGGTGTTCCGCTTTGCCTANTACAAGCTGCTTAAGAAGGCAGATGAGGGG 
TTAGCATNGCTGAGTGAGGACGGAAGAT CACCCATTTCCATCCGCCAGATGG CCTATGTTTN 
TGGTNTTTCCTTCGGTATCATCAGTGGTGTTTTNTCTGTTATCAATATTTTGGNTGATGCAN 
TTGGGCCAGGTGTGGTTGGGATCCATGGAGANTCACCCTATTAATTCCTGAATTCAGCCTTT 
NTGACAGCAGCCATTATCCTGNTCCATACCTTTTGGGGAGTTGTGTTTTTTGATGCCTGTGA 
GAGGAG 
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FIGURE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGNTGGNGCGCCCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 
TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 
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FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 
GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 
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FIGURE 225 

GC C C CAGGGAGCAGTGGGTGGTTATAACTCAGGC CCGGTG C C CAGAGC CCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 
CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATOGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 
GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCCAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 
CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAA 
CTGAGAAGTGGAAAAAAAAA 
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FIGURE 226 

MATARPPWMVWIjCALITALLL 

SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGP 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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FIGURE 227 

ATGG TCAACGACCGGTGGAAGACCATGGGCGGCGCTGCCCAACTTGAGGACCGGCCGCGCGA 
CAAGCCGCAGCGGCCGAGCTGCGGCTACGTGCTGTGCACCGTGCTGCTGGCCCTGGCTGTGC 
TGCTGGCTGTAGCTGTCACCGGTGCCGTGCTCTTCCTGAACCACGCCCACGCGCCGGGCACG 
GCGCCCCCACCTGTCGTCAGCACTGGGGCTGCCAGCGCCAACAGCGCCCTGGTCACTGTGGA 
AAGGGCGGACAGCTCGCACCTCAGCATCCTCATTGACCCGCGCTGCCCCGACCTCACCGACA 
GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGACAGAGCACCAGGCC 
CAGCCACGGCTGGTGGGCGACCAGGAGCAGGAGCTGCTGGACACGCTGGCCGACCAGCTGCC 
CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGGGGCTGCGGAAGGGGCATG 
GC ACGCTGGGCCAGGGCCTC AG CGCCCTGCAGAGTGAGCAGGGCCGCCTCATCC AG CTTCTC 
TCTGAGAGCCAGGGCCACATGGCTCACCTGGTGAACTCCGTCAGCGACATCCTGGATGCCCT 
GCAGAGGGACCGGGGGCTGGGCCGGCCCCGCAACAAGGCCGACCTTCAGAGAGCGCCTGCCC 
GGGGAACCCGGCCCCGGGGCTGTGCCACTGGCTCCCGGCCCCGAGACTGTCTGGACGTCCTC 
CTAAGCGGACAGCAGGACGATGGCGTCTACTCTGTCTTTCCCACCCACTACCCGGCCGGCTT 
CCAGGTGTACTGTGACATGCGCACGGACGGCGGCGGCTGGACGGTGTTTCAGCGCCGGGAGG 
ACGGCTCCGTGAACTTCTTCCGGGGCTGGGACGCGTACCGAGACGGCTTTGGCAGGCTCACC 
GGGGAGCACTGGCTAGGGCTCAAGAGGATCCACGCCCTGACCACACAGGCTGCCTACGAGCT 
GCACGTGGACCTGGAGGACTTTGAGAATGGCACGGCCTATGCCCGCTACGGGAGCTTCGGCG 
TGGGCTTGTTCTCCGTGGACCCTGAGGAAGACGGGTACCCGCTCACCGTGGCTGACTATTCC 
GGCACTGCAGGCGACTCCCTCCTGAAGCACAGCGGCATGAGGTTCACCACCAAGGACCGTGA 
CAGCGACCATTCAGAGAACAACTGTGCCGCCTTCTACCGCGGTGCCTGGTGGTACCGCAACT 
GCCACACGTCCAACCTCAATGGGCAGTACCTGCGCGGTGCGCACGCCTCCTATGCCGACGGC 
GTGGAGTGGTCCTCCTGGACCGGCTGGCAGTACTCACTCAAGTTCTCTGAGATGAAGATCCG 
GCCGGTCCGGGAGGACCGCSAOACTGGTGCACCTTGTCCTTGGCCCTGCTGGTCC CTGT CG C 
CCCATCCCCGACCCCACCTCACTCTTTCGTGAATGTTCTCCACCCACCTGTGCCTGGCGGAC 
CCACTCTCCAGTAGGGAGGGGCCGGGCCATCCCTGACACGAAGCTCCCTGGGCCGGTGAAGT 
CACACATCGCCTTCTCGCCGTCCCCACCCCCTCCATTTGGCAGCTCACTGATCTCTTGCCTC 
TGCTGATGGGGGCTGGCAAACTTGACGACCCCAACTCCTGCCTGCCCCCACTGTGACTCCGG 
TGCTGTTTGCCGTCCCCTGGCCAGGATGGTGGAGTCTGCCCCAGGCACCCTCTGCCCTGCCC 
GG CCAAATACCCGGCATTATGGGGAC AGAGAGCAGGGGG C AGACAGC AC CC CTGGAGTCCT C 
CTAGCAGATCGTGGGGAATGTCAGGTCTCTCTGAGGTCAGGTCTGAGGCCAGTATCCTCCAG 
CCCTCCCAATGCCAACCCCCACCCCGTTTCCCTGGTGCCCAGAGAACCCACCTCTCCCCCAA 
GGGCCTCAGCCTGGCTGTGGGCTGGGTGGCCCCATCCTACCAGGCCCTGAGGTCAGGATGGG 
GAGCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCTGGAGGCCACCCAC 
CCTGTGCCCCGGCAGGCCTGGGGTCTGCAGTCCTCTTACCTGCTGTGCCCACCTGCTCTCTG 
TCTCAAATGAGGCCCAACCCATCCCCCACCCAGCTCCCGGCCGTCCTCCTACCTGGGGCAGC 
CGGGGCTGCCATCCCATTTCTCCTGCCTCTGGAAGGTGGGTGGGGCCCTGCACCGTGGGGCT 
GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCTAGGCAGGGCTGGGATGAG 
GCTTGTACAACCCCCACCACCAATTTCCCAGGGACTCCAGGGTCCTGAGGCCTCCCAGGAGG 
GCCTTGGGGGTGATGACCCCTTCCCTGAGGTGGCTGTCTCCATGAGGAGGCCAACCCTTGCC 
ATTGACCGTGGCCACCTGGACCCAGGCCAGGCCCGGCCCGGCGAGTGGTCAAGGGACAGGGA 
CCACCTCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGACCAGGCACCACCTGGACA 
CTTTCTTGTTGAATCCTCCCAACACCCAGCACGCTGTCATCCCCACTCCTTGTGTGCACACA 
TGCAGAGGTGAGACCCGCAGGCTCCCAGGACCAGCAGCCACAAGGGCAGGGCTGGAGCCGGG 
TCCTCAGCTGTCTGCTCAGCAGCCCTGGACCCGCGTGCGTTACGTCAGGCCCAGATGCAGGG 
CGGCTTTTCCAAGGCCTCCTGATGGGGGCCTCCGAAAGGGCTGGAGTCAGCCTTGGGGAGCT 
GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTCCAAAGGACACCCGATGGCA 
GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCACTGAAGTTTGTGCTTAAAA 
AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCGTGTGACCTGGCTCTC 
TGTCCCAGTGCCACCAGGTCAT C CACATGCGCAG 
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FUGURE 22n 

MVNDRWKTMGGAAQLEDRPRDKPQRPSCGYV^ 

APPP WSTGAASANSALVTVERADSSHLS I L I D PRC PDLTDS FARLES AQ AS VLQALTEHQA 
QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 
SESQGHMAHLVNSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 
LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 
GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 
GTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VEWS SWTGWQY SLKF SEMK I R PVREDR 
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FIGURE 229 

GCAGTCAGAGACTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 
TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCT 
CACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGCTGGATGATG 
ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATCCAGAGCCC 
CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 
TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 
AGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCC 
CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 
TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 
AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGCAAAAGTTGGGAG 
GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 
CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 
TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 
TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 
GATGATCTTCTCAAAGGACTGCAAAGAATTGAAGCGTTGTGTCTGTGAGAGAAGGGCAGGAA 
TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCGAAGGTGACTO&TTCGCC 
CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 
TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 
TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 
AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 
GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 
GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 
ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACC 
AG CAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTT CATACTAGT CC CTT C C CAAT C CAT 
CAGTAAAGACCCCATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAG 
AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 
TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 
AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTCA 
TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 
GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 



WO 99/63088 „_ / / PCT/US99/12252 

FIGURE 23dD 

l^AKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHI I IDVTSPRSRDCVAILNGMIFSKD 
CKELKRCVCERRAGMVKPESLHVPPETLGEGD 
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FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACATGNTGG 
ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTTTGCCACAATTCGGCATCCAG 
AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 
CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 
CTAC CAGCTCTCCAATACTG GT CAAG ACAC CATTT CTCAAATGGAAGAAAGATTAGGAAATA 
CG TC CC AAGAGTTGCAATTTNTT CAAGT C C AGAATAT AAAGCTTGCAGGAAGTNTGCAGCAT 
GTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGGAACTTTGAAGGAGGGCAA 
AGTNTCCTCATNTACTATACACACACCACTTCCC 
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FIGURE 232 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCTC 
CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 
CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCTCTGGCTGCT 
CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGCGCCATGCGGGCATCGCTGCTGCTG 
TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 
GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGCCCAGGCCCGCCCCAACCTGGAGACTCTG 
AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 
GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 
CTACCACCCTGCACAGCCCGGCCAGGCCG CCAAAAAGGC CGTC AGGACCCGCTACATCAGCA 
CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCTCTCAGACCACGCTGCCC 
ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTCCTGAC 
GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 
CCATTGGACACCTGCACCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 
TGGTTCTTCCTGGTGCCTGACACCACCTACACCGAGGCGCACGGCCTGGCACGCCTAACTGG 
CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 
GAGAGCCCACCCCCGGCCGCTACTGCCACGGAGGCTTTGGGGTGCTGCTGTCGCGCATGCTG 
CTGCAACAACTGCGCCCCCACCTGGAAGGCTGCCGCAACGACATCGTCAGTGCGCGCCCTGA 
CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 
GGGTGCACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 
TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTGTGCACATGTACCAGCTGCACAA 
AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 
TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACGGGGCAGCTGCTTGGCCCGTGGGTATT 
CCAGCACCATCCCGCCCGGCCTCCCGCTTTGAGGTGCTGCGCTGGGACTACTTCACGGAGCA 
GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGCTGACCGGGCTG 
ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGCCTTG 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 
ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCCCCAGGGAGGCCGCCGGCCCCTCACTC 
GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 
GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCTGG 
CTTCTTGGAGGCCTTTGCCACTGCAGCACTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 
TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 
GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 
TGTGCAGACAGCCGCACCCTCACCACTGCGCCTCATGGATCTACTCTCCAAGAAGCACCCGC 
TGGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACGCCTGACTTCCTGAACCGC 
TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCTTCTTTCCCATGCATTTCCAAGCCTTCCA 
CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 
TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACGTGGCAGCCCGTGGG 
CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 
GTTCCTCCACTTCTCCAGTCTGCATGTGCTGCGGGCGGTGGAGCCGGCGCTGCTGCAGCGCT 
ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 
GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 
CAACAGCACCTGACCCCACCCTGTCCCCGTGGGCCGTGGCATGGCCACACCCCACCCCACTT 
CTCCCCCAAAACCAGAGCCACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 
AAGCTGGCCCACTGGTCCCCTCTCTGGCTCTGTGGGTCCCTGGGCTCTGGACAAGCACTGGG 
GGACGTGCCCCCAGAGCCACCCACTTCTCATCCCAAACCCAGTTTCCCTGCCCCCTGACGCT 
GCTGATTCGGGCTGTGGCCTCCACGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCCTGC 
CTCTGGGCCCTGGGGGCTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 
GCATCTCCCAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCTGCCTTTAATAAACTGGCCA 
AGTGTGGAAAAA 
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FJ IOTTO 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSELPPRGNTNAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKKAVRTRYISTELGIRQRLLVAVL 
TSQTTLPTLGVAVNRTLGHRLERWFLTGARGRRAPPGMAWTLGEERPIGHLHLALRHLLE 
QHGDDFDWFFLVPDTTYTEAHGLARLTGHLSLiASAAHLYLGRPQDFIGGEPTPGRYCHGGFG 
VLLS RMLLQQLRPHLEGCRND I VS ARPDEWLGRC I LDATG VG CTGDHEGVHYSHLELS PGE P 
VQEGDPHFRSALTAHPVRDPVHMYQLHKAFARAELERTYQEIQELQWEIQNTSHLAVDGDRA 
AAWPVG I PAPSRPASRFE VLRWD YFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 
RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 
LPVPYVTEASRLTVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVA 
HAJDVFAPVKAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLIiAGPDTVL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 
TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 
AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 
ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 
CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 
GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 
AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 
GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 
TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTCATGGCGGCTGGGCTGTTTGGTTTGAGC 
GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 
ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 
CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 
GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 
CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 
GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 
CTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 
ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 
CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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FIGURE 235 

MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTVVAPSAVAGKRPPEPTTPWQEDPEPE 
DENLYEKNPDSHGYDKDPVLDVWNMRLVFFFGVS I ILVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 
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GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GT GG CGGGAAAGCGGCCCCCAGAACCG AC CACACCGTGGCAAG AG GACCCAGAACCCGAGG A 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 



WO 99/63088 / PCT/US99/12252 

FIGURE 237 



GCGGCGGCTATGCCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 
TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 
ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 
TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCAGCTGATCTCCAAGTATTCTCTACGGGA 
GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 
AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 
TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAACTTCATCGA 
CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACTG 
ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 
CCOTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 
TCGCTTGTTC CACACCAG CTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 
CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 
ATCACGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 
GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 
ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 
CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 
CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 
ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 
AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 
GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 
ACTACCAGCCTGCCCAGGACCGGCTGCAACCCCACCTCCTGGAGATGCTGATTCAGCTGCCG 
GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 
CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 
GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 
GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 
GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 
GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 
GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTCTGATT 
CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 
TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 
CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 
CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 
AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 
GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 
TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 
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FIGURE 238 

MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFCFRTRWDSELQREGVSHY 
RLFPKALGQLISKYSLRELHLSFTQGFWRTRYWGPPFLQAPSGAELWVWFQDTVTDVDKSWK 
ELSNVLSGIFCASLNFIDSTNTVTPTASFKPLGLANDTDHYFLRYAVLPREWCTENLTPWK 
KLLPCSSKAGLS VLLKADRLFHTS YHSQAVH I RPVCRNARCTS I SWELRQTLS WFDAFITG 
QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 
PYRAFPVLLLDTVPWYLRLYVHTLTITSKGKENKPSYIHYQPAQDRLQPHLLEMLIQLPANS 
VTKVS I QFERALLKWTEYTPDPNHGF YVS PSVLSALVPSMVAAKPVDWEE S PLFNSLF PVSD 
GSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTWAVCYGSFYNLLTRTFHIEEPRTGGLA 
KRLANL I RRARGVP PL 
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FIGURE 239 

CAAC ATGG GGTCCAGCAGCTTCTTGGTCCTCATGGTGTCTCTCGTTCTTGTGACCCTGGTGG 
CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTT GC CCAG CTGACAACGTACGC 
TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 
TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 
ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGC C AGGATGGGAGG C CAAGTGT CCAGG C 
TCCTCCTCTACCAGGTGTCCTCAGAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 
TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 
CCACCAAAGAAAAC CTGAGCTTGAAGTCCTTTTC CC CAAAAAGAGGGAAGAGTCACAAAAAG 
TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 
GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 
TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 
CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCTCTAGAACTGTAAGAAATAAATAT 
TTGCTGTTTATAATCCAA 
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FIGURE im 

MGSS SFLVLMVSLVIjVTLVAVEGVKEG i ekagvcp adnvrcfksdp pqchtdqdclgerkcc 

YLHCGFKCVI'PVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 
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FICURE 241 

AAACTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 
AAACCTGAGCGGTGAGACTCCCAG CTGCCTACAT CAAGGCCCCAGGAC ATGCAG l AACCTTCC 
TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 
CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 
TTAAGGAGC CTCAAACAAAGCCTT CCAGGCAT CAAC GC AC AG AGAACATTAAAGAAAGGTCT 
CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 
TGCAGAGCCAGCGCCAGAGAAC AATGCCCT CAACAC ACAAAC C CAG C C CAAGGC C CACACCA 
C CGGAGACAGAGGAAAGGAGGCCAAC CAGGCACCGC CGGAGGAG CAGGACAAGGTGCC CCAC 
ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTC 
ACCC AGAGGG CAAGATGCAGGGATGGC CT CTGGCAGGACAGAGGCACAAT CATGGAAGAGCC 
AGGACACAAAGACGAC CCAAGGAAATGGGGGC CAGACCAGGAAGCTGACGG C CTCCAGGACG 
GTGT CAGAGAAGCACCAGGGCAAAG CGG CAAC CACAGCCAAGACGCTCATTCCCAAAAGTCA 
GCAC AGAATG CTGGCTCC CACAGGAGCAGTGT CAACAAGGACGAGACAGAAAGG AGTGACCA 
CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 
AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 
GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 
TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 
TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 
CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 
TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 
GCCGTGGTGGGCAACGGGGGCATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCA 
CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 
GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 
CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 
CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 
TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 
TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 
TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 
AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 
CACTAC TATGAT ACAT CATGGAAGCGGCTGATCTTTTACATAAACCATGACT T C AAG C TGGA 
GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 
CCGGAACTGCCAAAGCCAAGAACTOACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 
CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 
GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 
GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 
GATTTTTAATTAAATGGGGTGATGGGTGGCCAAT AC CACAATTC CTG CTGAAAAACACT CTT 
CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATT^ 

GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 
AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 
TCITTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 
GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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FIGURE 242 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKP 
KSQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 
KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 
KAATTAKTLI PKSQHRMLAPTGAVSTRTRQKGVTTAVI PPKEKKPQATPPPAPFQSPTTQRN 
QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 
NQSEWDRLEHFAPPFGFMELNYSLVQKVVTRFPPVPQQQLLLASLPAGSLRCITCAWGNGG 
ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 
LGKDVRYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFL 
RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 
KRLIFYINHDFKLEREVWKRLHDEGIIRLYQRPGPGTAKAKN 
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FIGURE 243 

CGATGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 
GAGC AGCGAGTGGAAATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTT T C TAAAAC AC CT 
TACAGGCCCTCITTATTTTAGTCCAAAGTGCAGCAAACACTTCCATAGACTTTA 
CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 
GTCAGTCCAGTGTGCATGGAGGATAAGTGAG CAGAC CGTACAGGAG CAG CACACCAGGAG C C 
ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTATCTTTATACACATCCCCTCATGG 
ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 
ACAGTTTGCAGATATATATT CGATAAATCAGTGTACTTGACAGTGTTAT CTGTCACTTATTT 
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FIGURE 244 

MRGPGHPLLLGLLLVLGPSPEQRVEIVPRDLRMKDKFLKHLTGPLYFSPKCSKHFHRLYHNT 
RDCTI PAYYKRCARLLTRLAVSPVCMEDK 
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FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 
CCCGACCCCGGCCGCGCCCAGCCCCCACCATOCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 
CTCACCGCAATCGCTCTGTTGGTGCTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 
GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 
GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 
CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 
CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 
ACCG CCGGCG C C AGCAGCTC CAGAGC CCATTTGAAGG C CAGGAGATTCCAATGAC AGGCATC 
CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 
TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCAATATCCACTCTACCCAGCTGGGCCCC 
CAGT CTACAACC CTGCAGCTCCT CCT C CCTATATGCCACCACAG CC CT CTTAC C CGGGAG C C 
TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 
CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCCCCAGACCAA 
GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 
ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 
GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 
CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 
TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 
CTAGATTAAAGCTGTAAAGACAAAA 
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FIGURE 246 

MPPAGLRRAAPLTAIALLVLGAPLVLAGEDCLWYLDRNGSWHPGFNCEFFTFCCGTCYHRYC 
CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLYRRRQQLQSP 
FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 
YMPPQPSYPGA 
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FIGURE 247 

GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAA 
CCCCAGGTAGGTAGAGCAAGAAGAI5GTGTTTCTGCCC CT CAAATGGTCCCTTGCAACCATG 
TCATTTCTACTTTCCTCAOTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 
CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTC 
CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 
TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 
TAGT CAC C AC CT GC AGATAT CTAGGG C CACCCTCAGGAAGGGAGCTGGAGAGAGGCTAT CGG 
AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAG 
CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 
TTTCCACGGATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCAT 
CAACACAATTTGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTC 
AAAG CAAGTTTCT CAAT CAAAATTAGAAGAGAG CC AAGG CA C CTAGCCATCTC CAATATGCC 
ATTGGTGAAATCTGTGACTGTTGCTGAAQGACTCATAGAAGACCATTTTGATGTCACTGTGA 
AGATGAGCACCTATCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACC 
AAGAGTGGAGTCAAGGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGC 
ACTGGATGCTGCGGTG ACTCTTCTAGAATTTTATGAGGATTAT TT CAG C ATAC CGTATC CC C 
TACCCAAACAAGATCTTGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGA 
CTGACAACATATAGAGAATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAA 
GCTTGGCATCACAGTGACTGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCA 
CTATGGAATGGTGGAATGATCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTG 
TCTGTCAGTGTGACCCATCCTGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGA 
CGCAATGGAGGTAGATGCTTTAAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTG 
CTCAGATCCGGGAGATGTTTGATGATGTTTCTT ATGATAAGGGAG CTTGTATT CTGAATAT G 
CTAAGGGAGTATCTTAGCGCTGACGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 
TAGCTATAAAAATACAAAAAACGAGGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAG 
ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTCACAT 
TGGCATCAGGAAGGGGTGGATGTGAAAACCATGATGAACACTTGGACACTGCAGAGGGGTTT 
TCCCCTAATAACCATCACAGTGAGGGGGAGGAATGTACACATGAAGCAAGAGCACTACATGA 
AGGGCTCTGACGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 
AGCAAATCCAACATGGTCCATCGATTTTTGCTAAAAACAAAAACAGATGTGCTCATCCTCCC 
AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 
AGGATGATGGATGGGACTCTTTGACTGGCCTTTTAAAAGGAACACACACAGCAGTCAGCAGT 
AATGATCGGGCAAGTCTCATTAACAATGCATTTCAGCTCGTCAGCATTGGGAAGCTGTCCAT 
TGAAAAGGCCTTGGATTTATCCCTGTACTTGAAACATGAAACTGAAATTATGCCCGTGTTTC 
AAGGTTTGAATG AG CTGATT CCTATGTATAAGTTAATGGAGAAAAGAGATATGAATGAAGTG 
GAAACTCAATTCAAGGCCTTCCTCATCAGGCTGCTAAGGGACCTCATTGATAAGCAGACATG 
GACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCGGAGTGAACTACTACTCCTCGCCTGTG 
TGCACAACTAT CAGC CGTGCGTACAGAGGGCAGAAGGCTATTTC AGAAAGTGGAAGGAAT CC 
AATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGCAGTGTTTGCTGTGGGGGCCCAGAG 
CACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTTTTCTTTGTCCAGTACTGAGAAAA 
GCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGAAAAGCTTCAATGGCTACTAGAT 
GAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCCACAAATTCTTACACTCATTGG 
CAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAGGAAAAACTGGAACAAACTTG 
TAa^AAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGTAATGGGTAC^CAAATC^ 
TTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAGCTCTTTGAAAGAAAATGG 
TTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGAAAACATCGGTTGGATGG 
ATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCTTGAACGTATGT^AAAA 
TTCCTCCCTTGCCCGGTTCCTGTTATCTCTAATCACCAACATTTTGTTGAGTGTATTTTCAA 
ACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTCAACTCATTTTTTGA 
CTATCCCTGTGAAAAGAATAG CTGTT AGTT TTT CATGAAT GGGCTTTTTCATGAATGGGCTA 
TCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACCCAAGTGTTGGGT 
TCCCTGCCAC^GAAGAATAAAGTACCTTATTCTTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGUEE 248 

IWFLPLKWSLATMSFLLSSLLALLTVSTPSWCQSTEASPKRSDGTPFPWNKIRLPEYVIPVH 
YDLL I HANLTTLTFWGTTKVE I TASQ PTSTII LHSHHLQ I SRATLRKGAGERLS EE PLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTA 
ARMAFPCFDEPAFKASFSIKIRREPRHLAISNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVA 
FIISDFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAA 
IPDFQSGAMEOTGLTTYRESALLFDAEKSSASSKLGITVTVAHELAHQWFGNLVTMEWWNDL 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFD 
DVSYDKGACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDG 
FCSRSQHSSSSSHVmQEGVDVKTMMNTWTLQRGFPLITITVRGRr^HMKQEHYMKGSDGAPD 
TGYLWHVPLTFITSKSNMVHRFLLKTKTDVLILPEEVEWIKFNVGMNGYYIVHYEDDGWDSL 
TGLLKGTHTAVSSNDRASLINNAFQLVSIGKLS I EKALDLSLYLKHETEIMPVFQGLNELI P 
MYKLMEKRDMNEVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLACVHNYQPCV 
QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALC 
RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGS 
SSIAHMVMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 
VWLQSEKLERM 



WO99/630S8 / PCT/US99/12252 

FIGURE 249 

CAGCCACAGACGGGTCATQAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCAC 
TGCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCC 
GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGA 
CACGTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 
AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 
TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 
TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGG 
AAGGCTGTCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGAT 
GG CCTCCTCAGGCTCAGGGGAGGAGGCATCTTCT CCAAT CTGAGAGTC CAGGGATG CATG CC 
CCAGCCAGGTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 
ACTGCAAT AGGAAAGATTTT CTGAC CTGTCATCGGGGGACCAC CATTAT GACACACGGAAAC 
TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGT 
GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAG 
GCTGCAGCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGG 
GTGCTTGTGGCCTCCTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 
CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTC 
CTACCTGTGTGCAGCCCCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGG 
GGCGCCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAAT 
GAGCATTCAGGGCTGCGTGGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCG 
GGATCTTCTCTGCGCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 
GGGGCTGAGGGCCTGGAGTCTCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTG 
GTGGGGAGTGGTTTGCCCTTCCTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGA 
CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 
CCATTCTGTCCATGAATCATCTTCCCCACACACAATCATTCATATCTACTCACCTAACAGCA 
ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTG 
GCTGCATGTATCTGATAATACAGACC CTGTC CTTTCA 
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FIGURE 2§ti) 

MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGL^ 

ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 

PADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCMPQPGCN 

LLNGTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETL 

LLIDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 

SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 

VAQPSS FLLNHTRQ IGI FSAREKRDVQPPASQHEGGGAEGLES LTWGVGIaALAPALWWGWCPS C 
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CTCUMK 251 

GCGAOGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCGCT 
CAGG&T^GGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCTGGCCTTCCTGTCACTGCTG 
CCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTCGTCCCTGG 
CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCTGGAAGAGTCG 
GCCCCACGGGAGAAAAAGGAGACATGGGGGACAAAGGA CAGAAAGGCAGT GTGGGTCGTCAT 
GGAAAAATTGGTCC CATTGG CT CTAAAGGTGAGAAAGGAG ATT C CGGTGACATAGGACCCC C 
TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTGCGCAAGGCCATCGGGG 
AGATGGACAAC C AGGT CT CT CAGCTGAC CAGCGAGCT CAAGTTCATCAAGAATGCTGTCGC C 
GGTGTGCGCGAGACGGAGAGCAAGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTACGCGGA 
CGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGCATGCCCAAGGACGAGGCTGCCA 
ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 
GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGCGGACCTTCAACAA 
GTGGCGCAGCGGTGAGCCCAACAATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 
CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAGTTTGACAAG 
GAGAACATGTGAGCCTCAGGCTGGGGCTGCCCATTGGGGGCCCCACATGTCCCTGCAGGGTT 
GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAG 
GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATG 
AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 
ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGC 
TATACAATAAAAT CTTTAAGTAGTGCAGTAGTTAAGTC CAAAAAAAAAAAAAAAAAAA 



WO 99/63088 ^ -^^j ^ PCT/US99/12252 

FIGURE 252 

MRGNLALVGVLISLAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGRVG 
PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 
MDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAAN 
GLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVAS 
GGWNDVACHTTMYFMCEFDKENM 
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FIGURE 25 3 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGCAG 
CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGACCACC 
CAGCCCCCGGCACAGAGCTGCTCCACAGGCACCATQAGGATCATGCTGCTATTCACAGCCAT 
CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGAGGAGGTGG 
TTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCAGAGACTCTTC 
AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAA 
GGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTTTGTGGGACTTATGGGCAAGA 
GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 
CT TCAT CCCAATCAGCTTGGATCCACAGGAAAGT CTT CC CTGGGAACAGAGGAGC AGAGACC 
TTTAJ^GACTCTCCTACGGATGTGAATCAAGAGAA 

AT CCCCCGAGAGCAGAATAGGTACTCCACTTC CGGACTCCTGGACTGCATTAGGAAGACCTC 
TTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGT 
AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCA 
TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 
CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 254 

MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 
ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 
SSLGTEEQRPL 
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FIGURE 2§5 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTGTGCCTGCTGTGCC 
CGCGCTGTCGCCGCTGCTACCGCGTCTGCTGGACGCGGGAGACGCCAGCGAGCTGGTGATTG 
GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGCCCCAGGAGCCCAGGCTGCCCCGTGAGTC 
CCATAGTTGCTGCAGGAGTGGAGCCaiSAGCTGCGTCCTGGGTGGTGTCATCCCCTTGGGGC 
TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACTCTCTTAGAGGAG 
CTGCTCAGCAAATACCAGCACAACGAGTCTCACTCCCGGGTCCGCAGAGCCATCCCCAGGGA 
GGACAAGGAGGAGATCCTCATGCTGCACAACAAGCTTCGGGGCCAGGTGCAGCCTCAGGCCT 
CCAACATGGAGTACATGGTGAGCGCCGGCTCCGGCCGCAGAGGCTGGCACCGGGGGTGGGGC 
CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCTCTGTTCCCCAGCCAGTGCGTGTGATGG 
CTGGCTCAGGGTCTCCTCTGGCAGGGGAGGATCCCGGCTCTGTTCTGTTTTGTTTGTTTGTT 
TTGAGACAGGGTCTCACTCTGCCACTGACGCTGGAGTGCAATGGCACAATCGTCATGCCCTG 
AAACC TTAGA CTCCCGGGGTTAAGCGATCCTGCTTCAGCCTCCCAAGTAGCTGGAACTACAG 
GCATGCACCATGGTGCCCAGCTAGATTTTAAATATTTTGTGGAGATGGGGGTCTTGCTACGT 
TGCCCAGGCTGGTCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTCAGCCTCTCAAAGTG 
CTAGGATTATAGGCATGAGTCACCCTGTCTGGCTCTGGCTCTGTTCTTAACATTCTGCCAAA 
ACAACACACGTGGGTTCCCTGTGCAGAGCCTGCCTCGTTGCCTTCATGTCACTCTTGGTAGC 
TCCACTGGGAACACAGCTCTCAGCCTTTCCCACCTGGAGGCAGAGTGGGGAGGGGCCCAGGG 
CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCACACGCTAGCTGCACCACCCTGACTTCTC 
CTTAGCCCGTGTGAGCCTCACTTTCCACTTGGAGAGTCCTTCCTCGCGTGGTTGCCATGACT 
GTGAGATAAGTCGAGGCTGTGAAGGGCCCGGCACAGACTGACCTGCCTCCCCAACCCCTAGG 
CTTTGCTAACCGGGAAAGGAGCTAACGGTGACAGAAGACAGCCAAGGTCAACCCTCCCGGGT 
GATTGTGATGGGTGTTCCAGGTGTGGTTGGGCGATGCTGCTACTTGACCCCAAGCTCCAGTG 
TGGAAACTTCCTTCCTGGCTGGTTTTCCAGAACTACAGAGGAATGGACCACAGTCTTCCAGG 
GTCCCTCCTCGTCCACCAACCGGGAGCCTCCACCTTGGCCATCCGTCAGCTATGAATGGCTT 
TTTAAACAAACCCACGTCCCAGCCTGGGTAACATGGTAAAGCCCCGTCTCTACAAAAAAATC 
CAAGTTAGCCGGGCATGGTGGTGCGCACCTGTAGTCCCAGCTGCAGTGGGACTGAGGTGGAG 
GTGGAGGTGGGGGGTGGGAGCTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACCACTGCACTCC AGCCTGGGTGACAGAG CAAGAC CCTGTCTCAAAAA 
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MGTOE 256 

MS CVLGGVI PLGLLFLVCGSQGYLLPNVTLLEELLSKYQHNESHSRVRRAI PREDKEE ILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGWGLGHQPALFPSQLCSPASACDGWLRVSSGR 
GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 
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FIGURE 257 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGG 
GTCTGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGG 
GTATGACTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 
TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCAC 
CCTCCACCATGCAAGATCTCAACACCATGTTGTCTGCAACACAJSACAGCCATTGAAGCCTG 
TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTT 
CAGCAGGCCCCCACCCTCCTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARSQHHWCNT 
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FIGURE 259 

AATTGTATCTGTGTAATGTTAAAACAAACGAAATAAAATAGAAGGAAAAACTTTCTGAGTTT 
CAAAAACAACAGACTAGTACTCTAAAGAACTCTTTAAAACAATTAACTGTTAGGATTGCAGT 
T&SSATTGGATATTATTTAATTCTGTTTCTGATGTGGGGTTCCTCCACTGTGTTCTGTGTGC 
TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAATGAATGCTTAGTGGATCTG 
TGCCTCTTACGCATATGTTACAAATTATCTGGAGTTCCTAATCAATGCAGAGTTCCCCTCCC 
CTCCGATTGTTCTAAATAATTGAAAGATGTCTGCTGTGGAAAAAGGCATGTATTTAAATCTG 
TATGATTCT CAACCAT CTTTAGTTGGGAAAGGTC CTTGAAAG C CAATGGAAATACTTTTTTT 
TTTTCTTGGCACTAATCAAGTGAGTGTTACCTTTTCACTTAGTAGGATGTGTTGTTACGCTA 
GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAACAGCCATCATTTTATTTT 
ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACTTCAT 
TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCTTTCTAGTTGCATTTTCTC 
AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAGTACTGTCTCTCTACTA 
TCTTCAGATTACTTGATTCAAATAAACCAATTATGTTTGTAATTGATATTAATAAAACCAGA 
ATAAAAGTTCATATCTACCC 
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FIGURE 26(D 



MIGVTLILFLMWGSSTVFCVLLIFTIAEASFSVENECLVDLCLLRICYKLSGVPNQCRVPLP 
SDCSK 
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FIGURE 261 



PCI7US99/12252 



GAGGATTTGC CACAGCAGCGGATAGAGC AGGAGAGC AC CACCGGAGC C CTTGAGACAT CCTT 
GAGAAGAGCCACAGCATAAGAGACTGC CCTG CTTGGTGTTTTGCAGGATQATGGTGGC CCTT 
CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTAC 
CCAGGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGG 
AAAAATGT AC C C AAG CAACGAGGGCAT ACATTC AAGAATT C CAAG AGTT CT CAAAAAAT AT A 
TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTT 
GGCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTG 
ACGAGTGC AT CGTATCAG AGGACAAGACACTGG CAGAAATGTTG CTCC AAGAAGCTGAAGAA 
GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTC 
TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATA 
ACTCTCCAAAGGTGTACTTATTAATTGGATCCA.GAAACAACACTGTTTGGGAATTTGCAAAC 
ATACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTC 
CTGGCAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTT 
CTAATGAGATAAT CAAATATAAC CT GCAGAAGAGG ACTGTGGAAGATCGAATGCTGCTCCCA 
GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGT 
GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCA 
CAAAGATTGAGCCGGGCACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAG 
GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCA 
GGGCCCTCATCGCATCACCTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGC 
CCAACTTGTTCTTCC CCAAGAGACCAAGAAGT CACT CC ATGATC CATTACAAC C C CAGAGAT 
AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGAT CATTTACAAACT C C AGAC AAAGAGAAA 
GCTGCCTCTGAAGSMTGCATTACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTC 
TACAGGACAGTGAGG CTATAGC CC CTTC ACAATATAGT AT C CCTCTAATCACACACAGGAAG 
AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTC 
CAAGAGCTTAGATGAGAGCAT AT CAT C AGGAAAGTTTCAAC AATGTC CATT ACT C CC CCAAA 
CCTCCTGGCTCTCAAGGATGACCACATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACT 
GCTCCCCAGCATTTACTGTAACTCTGCCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGC 
CCCTAATATTCACCACTGGCTTTTCTCTCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTT 
CAAATGTCTATTGATATTCTCCCATTTTCACTGCCCAACTAAAATACTATTAATATTTCTTT 
CTTTTCTTTTCTTTTTTTTGAGACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACTCC 
AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGGGATTACAGGCATGTGCCAC 
CACACCTGGCTTAAAATACTATTTCTTATTGAGGTTT AA CCT CTATTT CC C CTAG CC CTGT C 
CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGAATATCGCTTT 
CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTTTGTGAAACATGCACAAG 
TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACAAAGTGAAAGATACAGC 
TAGAAAATACTACAAATCCCATAGTTTTTCCATTGCCCAAGGAAGCATCAAATACGTATGTT 
TGTTCACCTACTCTTATAGTCAATGCGTTCATCGTTTCAGCCTAAAAATAATAGTCTGTCCC 
TTTAGCC^GTTTTCATGTCTGCACAAGACCTTTCAATAGGCCTTTCAAATGATAATTCCTCC 
AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCTCCTCTTGTCTTGCTGTCCTCTGT 
TTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTGAGCAAAAAAAAAAAAAAA 
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MMVALRGASALLVLFIAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 
FSKNISVMLGRCQTYTSEYKSAVGNLALRVERAQREIDYIQYLREADECIV3EDKTLAEMLL 
QEAEEEKKIRTLLNAS CDNMLMG I KS LKI VKKMMDTHGS WMKDAVYNS PKVYLL I GSRNNTV 
WEFANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKTNLQKRTVED 
RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDT 
PCRSQDAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 
YNPRDKQLYAWNEGNQ I I YKLQTKRKLPLK 
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FIGURE 263 

GGGCGCCCGCGTACTCACTAGCTGAGGTGGCAGTGGTTCCACCAAC&TgGAGCTCTCGCAGA 
TGTCGGAGCTCATGG<^CTGTCGGTGTTGCTTGGGCTGCT^CCCTGATGGCGACGGCGGCG 
GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 
AAATGGATTTC CACCTGACAAAT C TTCGGGAT C C AAGAAGCAGAAACAAT AT CAG CGGAT T C 
GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCTCCTGGCTGCAGCTCTGAAGAGC 
CACAGCGGGAACATATCTTGCATGGACTTTAGCAGCAATGGCAAATAGCTGGCTACCTGTGC 
AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTTCCTGCAGCGAGAGCACCGCAGCA 
TGAGAGC CAACGTGGAGCTGGAC CACGC CAC C CT GG TG CG CTTCAGCCCTGACTGCAGAG C C 
TTCATCGTCTGGCTGGCCAACGGGGACACCCTCCGTGTCTTCAAGATGACCAAGCGGGAGGA 
TGGGGGCTACACCTTCACAGCCACCCCAGAGGACTTCCCTAAAAAGCACAAGGCGCCTGTCA 
TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCTCCAGTGACACCACTGTC 
CTCATCTGGAGCCTGAAGGGTCAAGTGCTGTCTACCATCAACACCAACCAGATGAACAACAC 
ACACGCTGCTGTATCTCCCTGTGGCAGATTTGTAGCCTCGTGTGGCTTCACCCCAGATGTGA 
AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTGGTGCGAGCCTTCGAA 
CTAAAGGGCCACTCCGCGGCTGTGCACTCGTTTGCTTTCTCCAACGACTCACGGAGGATGGC 
TTCTGT CTCCAAGGATGGTACATGGAAACTGTGGGACACAGATG TGGAATACAAGAAGAAG C 
AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTGCCGCGCCGTGCCGC 
CTGGCCCTCTCCCCCAACGCCCAGGTCTTGGCCTTGGCCAGTGGCAGTAGTATTCATCTCTA 
CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAGTGTATCGCCA 
ACTTGTCCTTTGACATCACTGGCCGCTTTCTGGCCTCCTGTGGGGACCGGGCGGTGCGGCTG 
TTTCACAACACTCCTGGCCACCGAGCCATGGTGGAGGAGATGCAGGGCCACCTGAAGCGGGC 
CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 
AGAGCCTGGGTGCCCTGAAGAAGTGACTCTGGGAGGGCCCGGCGCAGAGGATTGAGGAGGAG 
GGATCTGGCCTCCTCATGGCACTGCTGCCATCTTTCCTCCCAGGTGGAAGCCTTTCAGAAGG 
AGTCTCCTGGTTTTCTTACTGGTGGCCCTGCTTCTTCCCATTGAAACTACTCTTGTCTACTT 
AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCTCCCTGACTAGTGGCCAAGGTGCTTTTCTTC 
CTCCCAGGCCCAGTGGGTGGAATCTGTCCCCACCTGGCACTGAGGAGAATGGTAGAGAGGAG 
AGGAGAGAGAGAGAGAATGTGATTTTTGG C CTTG TGGCAGCACATCCTCACACCCAAAGAAG 
TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTGAGTACTAAGCAGCAGTTTTGCAAGGA 
TGGGAGACTGGGATAGCTTCCCATCACAGAACTGTGTTCCATCAAAAAGACACTAAGGGATT 
TCCTTCTGGGCCTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACTCCTTGCA 
AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 
AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCTTTGAAACCAAA 
TTAGAAACACATTCCTTGGGAAGGC^AAGTTTTCTGGGACTTGATCATACATTTTATATGGT 
TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTCAGTTCATCAAG 
TTCATCAGATATTTGAGTGCCCACTCTGTGCCCAAATAAATATGAGCTGGGGATTAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 264 

MELSQMSELMGLSVLLGLLALMATAAVARGWLRAGEERSGRPACQKANGFPPDKSSGSKKQK 
QYQR I RKEKPQQHNFTHRLIiAAALKS HS GN I S CMD F S S NG K Y LATC ADDRT I R I WSTKDF LQ 
REHRSMRANVEL^HATLVRFSPDCRAFIVWLANGDTLRVFKMTKREDGGYTFTATPEDFPKK 
HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 
FTPDVKVWEVCFGKKGEFQEWRAFELKGHSAAVHSFAFSNDSRR^4ASVSKDGTWKLWDTDV 
EYKKKQDPYLLKTGRFEEAAGAAPCRLALSPNAQVLALASGSSIHLYNTRRGEKEECFERVH 
GECI ANLSFD I TGR FLAS CGDRAVRLFHNTPGHRAMVEEMQGHLKRASNES TRQRLQQQLTQ 
AQETLKSLGALKK 
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TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGG 
CAGTGTTTTGCCTTCACCCCAAGTGACCATQAGAGGTGCCACGCGAGTCTCAATCATGCTCC 
TCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGT 
GGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCT 
GGGGCGGGAAGGGGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCA 
AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTAC 
CGCTGCTCCATGGACTTGAAGAACATCAATTTTS^GCGCTTGCCTGGTCTCAGGATACCCA 
CCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGAC 
TCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAG 
GCAGACAT AC CTC C CATCATGACATGGT C C C CAGGCTGGCCTGAGGATGTCACAGCTTGAGG 
CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTA 
AATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTG 
CCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGT 
GCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGG 
ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGAC 
TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCA 
GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGA 
CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGT 
TAACCACTGAAGCCCCCAATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAA 
TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCA 
AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGG 
TGGGAG CAAGGGACAGGGAG CAGGGCAGGGGCTGAAAGGGGC ACTGATTCAGAC CAGGGAGG 
CAACTACACACCAACATGCTGGCTTTAGAATAAAAGCACCAACTGAAAA^ 
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FIGURE 266 

MRGATRVS IMLLLVTVSDCAVITGACERDVQCGAGTCCAI SLVfLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 
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FIGURE 267 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAAIfiTCTTTC 

CTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGC 

TGCTGCCTTGGCATTGCTGCTTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGG 

CCCTGGAGTACCTGGAGGATAT AG AC CTGAAAAC ACTGGAGAAG GAAC CAAGGACTTTCAAA 

GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTT 

CCTCTGTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 

TCCCCCTCTATG CAGTGGTAAAGGAG CACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 

TTCAAAGGAGAAATCTTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 

GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAG 

GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 

GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACT 

TTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGAT 

TGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATT 

GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 

TTAATGTATTTTAATATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGA 

CAAAAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA 

TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGG 

TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTC 

TCTACTAAAAATACAAAAATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCG 

GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 

CACCACTGTATTCCAGCCTGGGTGACTGAGACTCTAACTAA 
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FIGURE 268 

MS FLQDPS FFTMGMWS IGAGALGAAALALLLANTDVFLS K PQKAALE YLED I DLKTLE KE PR 
TFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLIX2LGVPLyAVVKEHIRTEVKDF 
QPYFKGE I FLDEKKKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGF I LGGVFV 
VGSGKQG I LLEHRE KE FGDKVNLLS VLEAAKM I KPQTLASEKK 
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FIGURE 2€9 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCG 
GGCCAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGA 
AGCCCCTTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTG 
CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACA 
GACCACTTCTGCAAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATG 
GCAACCTGCGTCCGGAAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTG 
CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGG 
CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGG 
ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCCATCTAgGTCCCCTCTCCTGCATCTGTCTCC 
CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAG 
TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACTCTGCCCCTGAGGTCAAGAGAGGATGGG 
GCTATTCACTTTTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 27(D) 



MANPGLGLLLALGLPFLLARWGRAWGQ I QTTS ANENSTVLPSSTS S S SDGNLRPEA I TAI I V 
VFSLLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 
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FIGURE 271 

AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTT 
TT^GGATTTTAATTTTCAAACACAGCAGAM^CATTTTTTCTGTCACTATTATTATTGTTG 
GTATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCT 
AT CAAGAAATAAAGAGAACCACAGTCAACCCACACAAT CATCTTTAGAAGAC AGTGTG ACT C 
CTACCAAAGCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGA 
GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTTAQATTCAATG 
ATTGTAAATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATA 
TATACAATAAGGTG AAAT TATAAGTAC C CTATGC AGTTGG CTGGACAGTTCTAAATTGGACT 
TTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGA 
TCATATAATTTGATACAAATAAAAGAAAAGTGTT C T C T C C C CTTACAGAATTGACATTTTAA 
ATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAA 
AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 
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MOTTO 212 



MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
GIVKGRNLDSRGLILGAEAWGRGVKKNT 
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FIGURE 273 

nrCAGGAATAACTAGAGAGGAAC AATG GGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCT 
GTGCCTGCTGCACCAGTCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATA 
TTGTCATTGTTATAGATCCTAGTGTGCCAGAAGATGAAAAAATAATTGAACAAATAGAGGAT 
ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGATTTTTTTTCAAAAA 
TGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATG 
AAAAC CATAAACATGC TGATGTTATAGTTGCACC AC CT AC AC TC CC AGGTAGAGATGAAC CA 
TACACCAAGCAGTT CACAGAATGTGGAGAGAAAGGCGAAT ACATT CACT TCAC CC CTGACC T 
TCTACTTGGAAAAAAACAAAATGAATATGGACCACCAGGCAAACTGTTTGTCCATGAGTGGG 
CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTGCTAAG 
TCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAA 
GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATG 
GAAAAGATTGTCAATTCTTTCCTGATAAAGTACAAACAGAAAAAG CAT C CATAATGTTT ATG 
CAAAGTATTGATT CTGTT GTTG AATTTT.GTAACGAAAAAACC CATAAT CAAGAAGCTCCAAG 
CCTACAAAACATAAAGTG CAATTTTAGAAGTACATGGG AGGTGATTAG CAATTCTG AGG AT T 
TTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATC 
AGTCAAAGAATTGTGTGC TTAGTT CTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCG CCT 
AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGG 
TGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTAAATAAGCTAATCCAAATAAAAAGC 
AGTGATGAAAGAAACACACTCATGG CAGG ATTACCT ACATATC CT CT GGGAGGAACTT CCAT 
CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGAT 
CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTG 
AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAAT 
AG AGATGAGCAAGATAACAGGAGGAAGT CAT TTTT ATGT TT CAGATGAAGCTCAGAACAATG 
GCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAGTCCCTT 
CAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAAT 
TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTA 
TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGTGGATGCAACTTCCAAA 
ATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACTTGGGCATACAATCTTCAAGC 
CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCTGTGC 
CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATT 
GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCAT 
TGAATCACAGAATGGACATACAGAAGTTTTGGAACTTTTGGATAATGGTGCAGGCGCTGATT 
CTTT CAAGAATGATGGAGTCTACT CC AGGTATTTTACAGCATATACAGAAAATGGCAGATAT 
AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCACT 
GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAA 
GACCTGAAATTGATGAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGA 
GGTGCATTTGTGGTATCACAAGTCCCAAGCCTTCCCTTGCCTGACCAATACCCACCAAGTCA 
AATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTTACATGGACAGCACCAG 
GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGA 
GGCCAACTCCAAGGAAAG CTTTGCATTTAAAC CAGAAAATAT CTCAGAAGAAAATGCAACCC 
ACATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCAAAAGTATCCAACATT 
GCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGATCCTACACCTACTCC 
TACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTAT 
TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTIS&ACCTTA 
ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT 
AAAGGATATTTCTGAAT CTTAAAATT CATCCC ATGTGTGATCATAAAC TCATAAAAATAATT 
TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATATGTAAAAACTGT 
CAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAAC 
AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAAT 
GATATTTCAAATTGCATCAAGAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAA 
GGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2741 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKI IEQIEDMVTTASTY 
LFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 
GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 
CSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWE 
FCNEKTHNQEAPSLQNIKCNFRSTVfEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 
I^KSGSMGGKDRLNRMNQAAKHFLLQTVENGSWGMVHFDSTATIVNKLIQIKSSDERNTLM 
AGLPTYPLGGTS I CSG I KYAFQVI GELHSQLDGSEVLLLTDGEDNTASS C I DEVKQSGAI VH 
FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 
LNSNAWMNDTVI IDSTVGKDTFFLITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPG 
TAKVGTWAYNLQAKANPETLT ITVTSRAANSSVPP I TVNAKMNKDVNS FP S PMIVYAE I LQG 
YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 
GANTARLKLRPPLNRAAYIPGWVVNGEIEANPPRPEIDEDTQTTLEDFSRTASGGAFVVSQV 
PSLPLPDQYPPSQITDLDATVHEDKI ILTWTAPGDNFDVGKVQRYI IRISAS ILDLRDSFDD 
ALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAIKSIDKSNLTSKVSNIAQVTLFIP 
QANPDD IDPTPTPTPTPTPDKSHNSGVNI STLVLSV I GSW I VNF ILSTTI 
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FIGURE 27S 

CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGG 
GCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGAC 
GCCAAGGTAATTTTGACCCAGAGGAGCAATGATGTAGCCACCTCCrAACCTTCCCTTCTTGAACCCCC 
AGTTATGCC AGGATTTACTAGAGAGTGTCAACTCAACCAGCAAG CGGCTCCTT CGGCTTAACTTGTGG 
TTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCAGTGCTCAGAAGTGACTTGCCTGAGGGTG 
GACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGGAATGAA 
GGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAAAGTAGAGA 
AGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGC 
TCCGGGGGCCCCTiAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCT 
TTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCT 
TGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGT 
TGGCCTGCACCCCAAAAGGTGACGAGGAGCAG CTGGCACTGC CCAGGGCCAACAGCCCCACGGGGAAG 
GAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCACCGCAACTACGTGAGCAGCCTGAAGCG 
GCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAATGGGCAGTACCAAG 
CCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCrCCTGGCC 
TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGC 
AGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCG 
AGGAGAAGCCTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACC 
CTGAACAATCCTGCAGAGAACAGCC CCAATCACCGTCCTTACACGGCCTCTGATTT CATAGAAGGGAT 
CTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAAAGGGGACCACAAACACGAAT 
TCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCTCAACATG 
GCCAACACGCTTAT CAATGTTATCGTG CCTCTAGCAAAAAGGGTGGACAAGTT CCGGCAGTTCATGCA 
GAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGGAAAG 
AAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTTCAGGAACTTTACC 
TT CATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGC CCG CTTCTGGAAGGG 
AAGCAACGTCCTTCTCTTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTA 
GGCTGAATACACAGCCAGGGAAGAAGGTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATA 
ATATACGGCCACCATGATGCAGTCCCTCCCTTGGAACAGCAGCTGGTCATAAAGAAGGAAACTGGATT 
TTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATATAGGTGGGTTTG 
AT CTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGT AT CTC CACAGCAACCTC 
ATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAGCT 
GACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCAT CCCACGGCCAG CTGG 
GCATGCTGGTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAA 
AAAAC ATGA ACTCCCAGAGAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGC^TTACTGAAAGTG 
GCTGCAACAGAGAAAAGACTTCCATAAAGGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAA 
AGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGAAATCAAAATCTCCGCTTTGCCTGCAAAAGT 
AACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTGAGATTATAAGCCTAATGGTG 
TGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGAAATATTCATG 
ATTTAAGAGC^GTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTA 
TTATGATACTAGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCG 
TGTCATATTTTCCCCAAGATTAACGAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCT 
CCGTTTTTTTCTTTTATTTAAAAATGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTA 
CCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTACATTTT^ 

GAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATATCCAAGGACATGCCAAAT 

CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACTAATACAGA 

CGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATG 

ACACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAG 

AAACCACATTTTCTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGT 

GTGAACCAAACAATCTCTTTTCAAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATG 

GAGAAAAATATATATATATATATATATATTGTGAAAGATCAAT CCATCTGC CAGAAT CT AGTGGGATG 

GAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAGTAACTGAATTATTTTTTAAATTAAGC 

AGTTCTACTCAATCACC^GATGCTTCTGAAAATTGCATTTTATTACCATTTCAAACTATTTTTTAAA 

AATAAATACAGTTAACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAGCACCAGATGCAT 

GAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAA 

GAACATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTT 

AATTAAAACACAGGCCATGAATGGAAGGTGGTATTGCACAGCTAATAAAAT^ 
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^4M^^7RRGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQ 
EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLIiAFL 
HSQVDKAEVNAGVKI.ATEYAAWFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIES 
ALETLNNPAENS PNHRP YTASDF I EGI YRTERDKGTL YELTFKGDHKHEFKRL I LFRPFS P I 
MKVKNE KLNMANTL I NV I VPLAKRVDKFRQFMQNFREMC I EQDGRVHLTWYFG KEEI NE VK 
GILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCR 
LNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRDFGFGMTCQYRSDFI 
NI GGFDLD I KGWGGEDVHLYRKYLHSNL I WRTPVRGLFHLWHEKRCMDELTPEQ YKMCMQS 
KAMNEASHGQLGMLVFRHE I EAHLRKQKQKTSSKKT 
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FIGURE 277 

GAAAGA ATG TTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCTGAACTCTGTCAACC 

AGGTGCAGAAAATGCTTTTAAAGTGAGACTTAGT ATCAGAACAGCT CTGGGAGATAAAGCAT 

ATGCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAA 

GTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGT 

ATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGC 

AATCAGCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACT 

CTGGAATTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTG 

GATTATTATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTAT 

C AGGGATCTGGCAACGTAGAAGAAAGAACAAAGAAC CAT CTGAAGTGGATGACG CTGAAGAT 

AAGTGTGAAAACATGATCACAATTGAAAATGGCAT CC CCTCTGATCCC CTGGACATGAAGGG 

GGGCATATTAATGATGCCTTCATQACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGT 

TGTTCTGCTTCCTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAAATA 

CCAAGAGCAGATCATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 

TGAAAGTGAAAAGCAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGAC 

TCAAAATATTCTAAAATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTG 

TAGTTATTGATTTAAGCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTC 

AAAGAC CTAAGGAAAAATAAATTTTC CAGTGG AGAAT ACATATAAT AT GGTGTAGAAATCAT 

TGAAAATGGATCCTTTTTGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAAAG 

TGAGAAGTAATTATTGTAAATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATT 

TTATCCTGTTATCACACCAACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGAC 

AATTCTATTTGTTGACCATTTCTACAATTTGTAAAAGTCCAATCT 

T AATAAT CAT CTCTTTTT AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 278 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRKVP 
NREATEISHVLLCNVTQRVSFWFVVTDPSKNHTLPAVEVQSAIRMNKNRiraAFFLhTDQTLE 
FLKIPSTLAPPMDPSVPIWI IIFGVIFCII IVAIALLILSGIWQRRRKNKEPSEVDDAEDKC 
ENMITIENGI PSDPLDMKGGILMMPS 
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FTIGUME 279 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGG 
GTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTC^IST 
ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGG 
CCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGC 
TCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGA 
ATTTTTOTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTC 
CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGA 
TGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGA 
AGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTA 
CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCAT 
AATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTC 
ATAAAGTGGTGG AGATAAAATCAAAAGAAGAGGAAAGG CTCAAC CAAGAGAAAAAGGT CT CT 
GTTTATTTAGAAGACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAA 
GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGT 
TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCAC 
AGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGA 
GTGTAAATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAA 
GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCC 
AATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 
ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTG 
TTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTC 
TC CT TAACTATGAGACACATCTTGTT TTACTGAATTTCTTTCAATATTCC AGGTGATAGATT 
TTTGTCG 
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FIGURE 2M 

MYGKSSTRAVLLLLGIQLTALWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVT 
WNFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQ 
VKNPPDVDGVIGEIRLSVVHTVRFSEIHFLALAIGSACAL^ 
AHKWEIKSKEEERLNQEKKVSVYLEDTD 
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GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGT 
ACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAAC^GCTGCTCCAGCTG 
ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACTGCTGCT 
GCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACCACTGCTCG 
TAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCCC1 
GAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATTTC 
ATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAA 
ATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 
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FIGURE 282 



MKFLAVLVLLGVS IFLVSAQNPTTAAPADTYPATGPADDEAPDAETTAAATTATTAAPTTAT 
TAASTTARKD I PVL PKWVGDLPNGRVCP 
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FIGURE 283 



GGACTCTGAAGGTCCCAAGCAGCTGCTGAGGCCCCCAAGGAAGTGGTTCCAACCTTGGACCC 
CTAGGGGTCTGGATTTGCTGGTTAACAAGATAACCTGAGGGCAGGACCCCATAGGGGAATGC 
TACCTCCTGCCCTTCCACCTGCCCTGGTGTTCACGGTGGCCTGGTCCCTCCTTGCCGAGAGA 
GTGTCCTGGGTCAGGGACGCAGAGGACGCTCACAGACTCCAGCCCTTTGTTACCGAGAGGAC 
ACTTGGCAAGGTCCAGCGATGGTCCGGAGTCCACACACAGACTGGCGGCAGGGCAGGAGGGG 
GACAGTTCTGTTGTGCTTGGTTGGACAGTAAGAGGGTCTTGGCCAGTCCAGGGTGGGGGGCG 
GCAAACTCCATAAAGAACCAGAGGGTCTGGGCCCCGGCCACAGAGTCATCTGCCCAGCTCCT 
CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTGCCAGTAAAACCACAGGCTGG 
ATTTGCCTGCGGGCCATGGTCCCTGTCTAGGGCAGCAATTCTCAACCTTCTTGCTCTCAGGA 
CCCCAAAGAGCTTTCATTGTATCTATTGATTTTTACCACATTAGCAATTAAAACTGAGAAAT 
GGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGAT 
CACCTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTACTAAAAA 
TACAAAAAATTAG CCAGGCACAGTGGTGTGCACTGGTAGTCC CAGTTACT CGGGAGGCTGAG 
GCAGGAAAATCGCTTGAACCCAGGAGGCGGACGTTGCGGTGAGCCGAGATCGCGCCGCTGAT 
TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 
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MLPPALPPALVFTVAWSLLAERVSWVRDAEDAHRLQPFVTERTLGKVQRWSGVHTQTGGRAG 
GGQFCCAWLDSKRVLASPGWGAANS I KNQRVWAPATESSAQLLCCWPVGVARGGALCQ 
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FIGURE im 

GTCATOCCAGTGCCTGCTCTGTGCCTGCTCTGGGCCCTGGCAATGGTGACCCGGCCTGCCTCA 
GCGGCCCCCATGGGCGGCCGAGAACTGGGACAGCATGAG^ 

GACCCTGCAGCTGGGCCAGGCCCTCAACGGTGTGTACAGGACCACGGAGGGACGGCTGACAA 
AGGCCAGGAACAGCCTGGGTCTCTATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 
CGGGGCCGGGATGCAG CC CAGGAACTTCGGGC AAGC CTGTTGGAGACT CAGATGGAGGAGGA 
TATTCTGCAG CTGCAGGCAGAGGC CACAGCTGAGGTGC TGGGGGAGGTGGC CCAGGCACAGA 
AGGTGCTACGGGACAGCGTGCAGCGGCTAGAAGTCCAGCTGAGGAGCGCCTGGCTGGGCCCT 
GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 
CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGCGAC 
AGATCCAGGAGAGACTCCACACAGCGGCGCTCCCAGCCTGAATCTGCCTGGATGGAACTGAG 
GACCAATCATGCTGCAAGGAACACTTCCACGCCCCGTGAGGCCCCTGTGCAGGGAGGAGCTG 
CCTGTTCACTGGGAT CAG C CAGGG CGCCGGGCCCCACTTCTGAGCACAGAGCAGAGACAGAC 
GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 
CCCTTTCATGCCTACACACCCCTCATTAAAGCAGAGTCGTGGCATTTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGWE 286 

MPVPALCLLWALAMVTRPASAAPMGGPELAQHEELTLLFHGTLQLGQALNGVYRTTEGRLTK 
ARNSLGLYGRTIELLGQEVSR'GRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 
VLRDSVQRLEVQLRSAWLGPAYREFEVLKAHADKQSHILWALTGHVQRQRREMVAQQHRLRQ 
IQERLHTAALPA 
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FIGURE 287 



GGCAAC&ISGCTCAGCAGGCTTGCCCCAGAGCCATGGCAAAGAATGGACTTGTAATTTGCAT 
CCTGGTGATCACCTTACTCCTGGACCAGACCACCAGCCACACATCCAGATTAAAAGCCAGGA 
AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGATCTGAAGACTCAAATTGAAAAGCTC 
TGGACAGAAGT CAATGC CTTGAAGGAAATTCAAG CC CTGCAG AC AGTCTGTCT CCG AGGC AC 
TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCATTTCCATGAGGCCAATG 
AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATCCCCAGGAACTCCGACGAAATCAACGCC 
CTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGACTTTTGGCTGGGCATCAATGA 
CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTCCTTCCTCAACTGGG 
ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAATCAGCTCAG 
GG CAAGTGGAGTGATGAGGCCTGTCGCAGCAGCAAGAGATACATATG CG AGTTC AC CATC C C 
TAA ATAG GTCTTTCTCCAATGTGTCCTCCAAGCAAGATTCATCATAACTTATAGGTTCATGA 
TC TCTAAG ATCAAGTAAAAATCATAATTTTTACTTATTAAAAAATTGCAACACAAGAT CAAT 
GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTAACACATTTCTTTGGGATTTTGCCCT 
TCCTGGGGTATAGGGGATCAGAAATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCT 
AAACAGACTAAAATCTTTCTCTCTAGTCTTTCTCACTTGTACAAACCCAGTTTGTTTTCAAA 
AAATGACAGTAGCAATGCAACTC^TCACTCTAGAAAAGCAAGCTTAGGCTACCTGAAAGATT 
TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCAGAGACTCTAGGT 
GCTATATAATCCAAAAACTTTTCAGCCTGTTGCTCATTCTGTCCCATGCTGGCAATAATACC 
TTGTCAGCCCATTACCCTTATTTTGAATTGCTCCATCTCCTGGTGGGACTTGTATCTTGTCT 
GC CATATCAGAACACAAACCCCTGAAGAGGTTCTGATTTGATTTTTTTTTTTTCTTCATGCC 
TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 
CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATGGGTAACCCTAAGGCATAT 
CAAAGAAGCAGATTGCATGATAAACGGAAATAGAAAAAAAGAACCTACATTTATTTTGCTTT 

TCGTATATTTATTTTTTTTAGCCATCATTATATGTTTAAGTCTATTATGGGCAACCAATCTT 
TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTCTGTGCAATT 
TTTTATTCTGCCTAGTGCTATTCTGCTTGTTTAACTAGATTGTACAAAATAACTTCATTGCT 
TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTTTAGAAGCAAACAATTTT 
AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAACATTGAATGTGTTTTGTGAACAATAT 
CCCACTTTGCAAACTTTAACTACACATGCTTGGAATTAAGTTTTAGCTGTTTTCATTGCTCA 
ATAATAAAGCCTGAATTCTGATCAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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HAQQAC FRAMAXNGLY I C I LV I TLLLDQTTSKTS RLKARrCKS KRRVRD KuGu LKTQ I E KLWT 

EVNALKEIQALQTVCTiRGTKVHKKCYLASEGLKHFHEANEDCISKGGILVIPRNSDEINALQ 

DYGKRSLPGVNDFWLGINDMVTEGKFVDVN^ 

WSDEACRSSKRYICEFTIPK 
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FIGURE 289 

GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTTGCCCGGGCAGCCGCAGGTTCCCCGCGCGC 

CCCGAGCCCCCGCGCC^ISAAGCTCGCCGCCCTCCTGGGGCTCTGCGTGGCCCTGTCCTGCA 

GCTCCGCTGCTGCTTTCTTAGTGGGCTCGGCCAAGCCTGTGGCCCAGCCTGTCGCTGCGCTG 

GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCCTCAACCCGCT 

GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCTCATAGAGGGCTCCCAGAAGT 

GTGTGGCTGAGCTGGGTCCCCAGGCCGTGGGGGCCGTGAAGGCCCTGAAGGCCCTGCTGGGG 

GCCCTGACAGTGTTTGGCTQAGCCGAGACTGGAGCATCTACACCTGAGGACAAGACGCTGCC 

CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGTCCATCCCCTTCCCCCGGCCCCT 

CTCAATAAACGTGGTTAAGAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAA 
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FUGTOE 29(0) 

MKLAALLGLCVALSCSSAAAFLVGSAKPVAQPVAALESAAEA^^ 
SLG I PVNHL I EGSQKCVAELGPQAVGAVKALKALLGALTVFG 
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OTTCTIRE 291 



PCTAJS99A2252 



TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACT 
CCTT(^CCTCCGCAGCCGATCACASGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTG 
GCACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCC 
TCAGAACCAGACCAGCAGGGTAGT GC AGGCTCCCAGGGAGGAAGAGGAAGAT GAGC AGGAGG 
CCAGCGAGGAGAAGGC CGGTGAGGAAGAGAAAGC CTGGCTGATGG C CAG CAGGC AG CAGCTT 
GCCAAGGAGACTTCAAACTTCGGATT CAGC CTGCTG CGAAAGAT CT CC ATGAGG CACGATGG 
CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCA 
CAGGGCCGACTGAAAC CCAGAT CAAGAGAGGG CT CC ACTTGCAGGC CCTGAAGCCCACCAAG 
CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACT 
GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCT 
TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCA 
CAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACT 
GTTTGATGAGATTAAT CCTGAAACCAAATTAATT CTTGTGGATTACATCTTGTT CAAAGGGA 
AATGGTTGAC CC CATTTGAC CC TGTCTT CACCGAAGT CGACACTTTCCAC CT GGACAAGTAC 
AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAA 
TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 
TGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACA 
TGGCTCAGAAACATGAAAACCAGAAACATGGAAGT T TT CTTT CCGAAGTT CAAG CTAGATCA 
GAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTG 
CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 
ACAGTGATTGAAGTTGATGAAAGGGG CACTGAGGCAGTGGCAGGAATCTTGT CAGAAATTAC 
TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAG 
AAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTGGTGAATCCGACTCTCCTATAATTCAGG 
ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 
TACCAGCAATGGATGGCAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTC 
AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 
CTGAAACGGTGGACAGTG CTGAAC CTTATATATATTTTTT CCTACACATACATACCT ATGAT 
AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 
TGAGTTACTTGAACGCAAGC ACTG CAAT AC CATAACAGTCAAACTGATTATAGAGAAGGCTA 
CTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 
CATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCATCCCACTACTCAGAATGGCATGC 
TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTCATTTAAT GTTTTTGGAC CATGGT 
TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 
GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 
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mourn 292 

MKWPSLLLSVLLAQVWLVPGLAPSPQSPETPAPQNQTSRWQAPREEEEDEQEASEEKAGE 
EE KAWLMASRQQLAKETSNFGFSLLRK I SMRHDGNMVF S P FGMS LAMTGLMLGATG PTETQ I 
KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 
DTECVPMNFRNASQ AKRLMNHY INKETRGK I PKLFDE I NPET KL I LVD Y I LFKGKWLTPFDP 
VFTEVDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLVVLMEKMGDHL 
ALEDYLTTDLVETWLRNMKTRNMEVFFPKFKLDQKYEMHELLRQMGIRRIFSPFADLSELSA 
TGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLF 
LGRWNPTLL 
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FIGURE 293 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCTCTACAGAGACGCGGAC C C CAGACATGAG 
GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTCCCAG 
CACCCAAGGTCCCTATCAAGATGCAAGTCAAACACTGGCCCTCAGAGCAGGACCCAGAGAAG 
GCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCC 
TGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCA 
TCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCC 
GAGC CCGACC ATGACAGC CTGTACCACC CT CCG CCTGAGGAGGACCAGGGCGAGGAGAGGCC 
CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACA 
TCTACCACCCCCAGSASGGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGG 
CTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAA 
AAAAAAAAAAAAAAAA 
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MRRLLLVTSLVWLLWEAGAVPAP KVP I KMQVKHWP SEQDPEKAWGARWEP PEKDDQLWL 
FPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEE 

RPRLWVMPNHQVLLGPEEDQDHIYHPQ 
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FIGURE 295 

AGAAAGCTGCACT CTGTTGAGCT CCAGGGCGCAG TGGAGGGAGGGAGTGAAGGAGC TC T C TG 
TACCCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATOAACCAACTCAGCTTCCTGC 
TGTTTCTC ATAG CG ACCACC AGAGGATGGAGTACAGATGAGGCTAATACTTACTT CAAGGAA 
TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 
TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCT 
GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATG 
CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGT C AGCAGGG CAGCAAAGCAGACTAC C C 
AGAGGGGGACGGCAACTGGGCCAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCG 
ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTG 
CCCAATAAGTCCCCCATGCAGCACTGGAGAAACAGCTCCCTGCTGAGGTACCGCACGGACAC 
TGGCTTCCTCCAGACACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 
ATGGAGAAGGAAAGTGTTGGACTGACAACGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 
GACGCCCAGAAAACAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 
TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 
TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGATACTTTCCAGAGGCCAGT 
CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 
CAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCGTTG^GAGTTTTGTG 
GGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 
GT AG CT AG AATGTT AAT GG C AGAAG AGAAAAC AAT AAAT C AT ATTGA CT CAAGAAAAAAA 
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FIGURE 296 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFG 
SAEAATSDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGI 
YQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAN 
ALCAGMRVTGCNTEHHCIGGGGYFPEASPQQCG^^ 
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3o-f/3vo 
FUCUIRE 291 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCC 
CACGAGGCTGCCGCATCCTGCCCTCGGAACAATGGGACTCGGCGCGCGAGGTGCTTGGGCCG 
CGCTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCC 
ATGGCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCCAGTGCTAACTCAAC 
AGAGACT CTC CAACATGTGCCTTCTGAC CATACAAATGAAACTT C CAACAGT ACTGTGAAAC 
CACCAACTTCAGTTGCCTCAGACTCCAGTAATACAACGGTCACCACCATGAAACCTACAGCG 
GCATCTAATACAACAACACC AGGGATGGTCT C AACAAATATGACTTCTACCACCTTAAAGT C 
TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAATGACCG 
TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCAT 
TCTGAAGCAAAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAAC 
GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTC 
GGTATCGAAC CATAGATGAACATGATGCCATCATTTAAGGAAAT CC ATGG AC CAAGGATGGA 
ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 
CTTTTT GAAAATAGTATAAACAGGCCATGCATATAATGTACAGT GTATTACGTAAATATGTA 
AAGATT CTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACAT CTGG AT CTTATAGACCGT 
T CATACAATGGTTTTAG CAAGTTCATAGTAAGAC AAACAAGT CCTATCTTTTTTTTTTGG CT 
GGGGTGGGGGCATTGGTCACATATGACCAGTAATTG AAAG AC GT CATCACTGAAAGACAGAA 
TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGT 
AGCTCACATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACA 
CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTG 
TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 
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MGLGARGAWAALLLGTLQViJUjiiGAAHESAAIvL^SANIENSGLPHNSSANSTETLQW 
TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQN 
TSQ I STSTMTVTHNSS VTSAAS SVT I TTTMHS EAKKGS KFDTGS FVGG I VLT LGVLS ILYIG 
CKMYYSRRGIRYRTIDEHDAI I 
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FIGURE 299 

CAGCCGGGTCCCAAGCCTGTGCCTGAGCCTGAGCCTGAGCCTGAGCCCGAGCCGGGAGCCGG 
TCGCGGGGGCTCCGGGCTGTGGGACCGCTGGGCCCCCAGCG&TSGCGACCCTGTGGGGAGGC 
CTTCTTCGGCTTGGCTCCTTGCTCAGCCTGTCGTGCCTGGCGCTTTCCGTGCTGCTGCTGGC 
GCAGCTGTCAGACGCCGCCAAGAATTTCGAGGATGTCAGATGTAAATGTATCTGCCCTCCCT 
ATAAAGAAAATTCTGGGCATATTT AT AATAAGAACATAT CT CAGAAAGATTGTGATTGC CTT 
CATGTTGTGGAGCCCATGCCTGTGCGGGGGCCTGATGTAGAAGCATACTGTCTACGCTGTGA 
ATGCAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGTTACCATTATAATTTATCTCTCCA 
TTTTGGGCCTTCTACTTCTGTACATGGTATATCTTACTCTGGTTGAGCCCATACTGAAGAGG 
CGCCTCTTTGGACATGCACAGTTGAT ACAGAGTGATGATGATATTGGGGATCAC CAGC CTTT 
TGCAAATG CACACGATGTGCTAGC CCGCTCC CGCAGTCGAG C CAACGTGC TGAAGAAGGTAG 
AATATGCACAGCAG CGCTGGAAGCTT CAAGTCCAAGAGCAGCGAAAGT CTGT CTTTGAC CG G 
CATGTTGTC CTCAG CTA&TTGGGAATTGAATT CAAGGTGACTAGAAAGAAACAGGCAGAC AA 
CTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTTAATACCTTGTTGATTTCACCAACT 
GTTGOTGGAAGATTCAAAACTGGAAGCAAAAACTTGCTTGATTTTTTTTTCTTGTTAACGTA 
ATAATAGAGACATTTTTAAAAGCACACAGCTCAAAGTCAGCCAATAAGTCTTTTCCTATTTG 
TGACTTTTACTAATAAAAATAAATCTGCCTGTAAATTATCTTGAAGTCCTTTACCTGGAACA 
AGCACTCTCTTTTTCACCACATAGTTTTAACTTGACTTTCAAGATAATTTTCAGGGTTTTTG 
TTGTTGTTGTTTTTTGTTTGTTTGTTTTGGTGGGAGAGGGGAGGGATGCCTGGGAAGTGGTT 
AACAACTTTTTTCAAGTCACTTTACTAAACAAACTTTTGTAAATAGACCTTACCTTCTATTT 
TCGAGTTTCATTTATATTTTGCAGTGTAGCCAGCCTCATCAAAGAGCTGACTTACTCATTTG 
ACTTTTGCACTGACTGTATTATCTGGGTATCTGCTGTGTCTGCACTTCATGGTAAACGGGAT 
CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTTTCTTCATGTACTGTGATGTCTGATG 
CAATGCATCCTAGAACAAACTGGCCATTTGCTAGTTTACTCTAAAGACTAAACATAGTCTTG 
GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGACAAATCCTAAGGACTTGGACACT 
TG CAATAAAGAAATTTTATTTTAAAC CCAAGCCTCC CTGG ATTGAT AATATATACACATTTG 
TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTTTGAGCTCCAATATGTGCAGCTTTGAACT 
AGGGCTGGGGTTGTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGGATAACTGGCTTTTT 
T CTT C CTATGTCCTCrTTGGAATGTAACAATAAAAATAATTTTTGAAACATCAA 
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MATLWGGLLRLGSLLSLSCLALSVLLLAQ 

QKDCDCLHWEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTI I IYLS ILGLLLLYMVYLTL 
VEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSR5RANVLNKVEYAQQRWKLQVQEQ 

RKSVFDRHWLS 
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FIGURE 3(0)1 

GCACCTGCGACCACCGTGAGCAGTC&SSGCGTACTCCACAGTGCAGAGAGTCGCTCTGGCTT 
CTGGGCTTGTCCTGGCTCTGTCGCTGCTGCTGCCCAAGGCCTTCCTGTCCCGCGGGAAGCGG 
CAGGAGCCG CCG C CGACACCTGAAGGAAAATTGGGCCGATTTCCACCTATGATGC AT CATCA 
CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGTTTCCAGAGGTCTCACCTTGCCGAGG 
CATTTGCAAAGGCCAAAGGATCAGGTGGAGGTGCTGGAGGAGGAGGTAGTGGAAGAGGTCTG 
ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATATACTGTACATTCTATT 
TAAGGTAAGTAGAATCATC CTAATC ATATTACATCAAJQAAAAT CTAATATGGCGATAAAAA 
TCATTGTCTACATTAAAACTTCTTATAGTTCATAAAATTATTTCAAATCCATCATCTCTTTA 
AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 
TTTACTCAATGTTTAAGTGTTTTGC CC CAAAATT C ACAACT AACAAGGCAGAACTAGGACTT 
GAACATGGATCTTTTGGTTCTTAATCCAGTGAGTGATACAATTCAATGCACTCCCCTGCCA 
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MAYSTVQRVALASGLVLALSL^^ 

PGARFQRSHLAEAFAKAKGSGGGAGGGGSGRGLMGQ 1 1 P I YGFG I FLY I L Y I LFKVSR 1 1 L I 
ILHQ 
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FIGURE 303 



CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTT 
GGATTTGAAAGTTGAGAGCAGCATGTTTTGC CCACTGAAACTCATCCTGCTGC CAGTGTT>.C 
TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTG 
GGTGATTCAG CTCTGATGGGATGTGTTTTC CAGAGCACAGAAGACAAATGTATATTCAAGAT 
AGACTGGACT CTGTCACCAGGAGAGCACGC CAAGGACGAAT ATGTGCTATACTATTACTCCA 
ATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGC 
AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGA 
AATC CGCCTCAAAGGGGAGAGC CAGGTGTTCAAGAAGGCGGTGGTACTGCATGTGC TT CCAG 
AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAG 
AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGA 
GG AGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCC AG AGCTGGG 
GCCACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATG 
CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCT 
GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 
CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 
TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 
GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAG 
AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTA 
CGGGAGGTGATCGAGGAAGAAGAAC CAAGTGAAAAAT CAGAGG CCAC CTACATGACCATGCA 
CCCAGTTTGGCCTTCTCTGAGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGG 
GAATGCCAAAAACACAGCAAGCCTTTTSAGAAGAATGGAGAGTCCCTTCATCTCAGCAGCGG 
TGGAGACTCTCTCCTGTGTGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTC 
CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGG 
CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGC 
CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTT 
GG AT CAGACCCTCCTGTGGGCAGGGTTCTTAGTGGAT GAGT TACTGGGAAGAATCAGAGAT A 
AAAACCAACCCAAATCAA 
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FIGURE 3(0)4 

MFCPLKLILLPVLLDYSLGI^DLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLSP 

EHAKDEYVLYYYSNLS VP I GRFQNRVHLMGD I LCNDGS LLLQDVQEADQGT Y I CE IRLKGE S 

QVFKKAWLHVLPEEPKEL^WHVGGLIQMGCVFQSTEVKHVTKVEWIFSGRRAKEEIVFR^ 

HKLRMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGNLVFKKTIV 

LHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTV 

LVKNTKKTNPE I KEKPCHFERCEGEKHI YS P 1 1 VRE V I EEEE PS EKSEATYMTMHPVWPSLR 

SDRNNSLEKKSGGGMPKTQQAF 
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FIGURE 3(D>S 

CT ATGAAGAAGCTTCCTGGAAAACAATAAG CAAAGG AAAACAAATG TGT C CC ATCT CACATG 
GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 
AAACTACATTTTGCAAAGTCATTGAACTCTGAGCTCAGTTGCAGTACTCGGGAAGCCATOCA 
GGATGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAAC CAGCT CT C GT CT C CGTTG 
GCCCTGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGG 
ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGA 
TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGG 
TAAAAC AATCAGAACTAAAGGG CACTTT CAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 
TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAG 
TAAGCAGTACTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGG 
AGTACATCAAAG C C AGGACTCATTTAATTCGTTGGGTCGGATTATCTC GC CAGAAGTCGAAT 
GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGA 
TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTG 
AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 
TAA TGCAAAGAGGTGGACAGGATAACft,CAGATAAGGGCTTTATTGTACAATAAAAGATATGT 
ATGAATGCATCAGTAGCTGAAAAAAAAAAAAAA 



CO: 
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FUGURE 3(Q>6 

MQDEDGYITI^IKTRKPALVSVGPASSSWWR^ 

QDENENRTGTLQQLAKRFCQYVWQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWE 
ESKQYCTDmATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEWKWEDGSVISENMFEFL 
EDGKGNMNCAYFHNGKMHPTFCENKHYIjMCERKAGMTKVDQLP 
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